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INTRODUCTION 

Immunoglobulin is the crux of the humoral immune response. 
As a cell surface receptor on B lymphocytes, immunoglobulin is 
responsible for instigating cellular processes as diverse as activa- 
tion, differentiation, and even programmed cell death. As secreted 
antibody in plasma and other bodily fluids, immunoglobulin is able 
to bind foreign antigen, thereby either neutralizing it directly or ini- 
tiating steps necessary to arm and recruit effector systems such as 
complement or antibody-dependent cell cytolysis by monocytic 
phagocytes. The ability of immunoglobulin to perform such a wide 
array of duties can be attributed to evolution's clever usage of a 
structural paradigm — the immunoglobulin domain — and its dupli- 
cation, diversification, and elaboration upon that design to endow 
it with an assortment of functional qualities. 

Despite the variety of purposes served by immunoglobulin mol- 
ecules, one feature remains common to virtually all considerations 
of immunoglobulin structure and function: immunoglobulins have 
an amazing capacity to interact with other molecules. In one sense, 
immunoglobulins must be able to effectively bind a finite set of 
invariant partners, such as Fc receptors, signal-transducing mole- 
cules, and components of the complement cascade. In another 
sense, immunoglobulins, collectively, must meet the challenge of 
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being able to recognize an essentially infinite array of antigenic 
determinants. More remarkable, perhaps, is the fact that immuno- 
globulin is frequently called upon to fulfill both of these binding 
responsibilities simultaneously, and in such a way as to mediate 
significant biological effects. As such, immunoglobulin molecules 
may be viewed as a marriage between the constraints engendered 
by biological continuity and the quest for diversity superimposed 
upon this evolutionary framework. 

The lengths to which evolution has gone in order to bestow 
immunoglobulin with these conflicting capabilities has been the 
subject of intense scientific scrutiny, and has yielded innumerable 
fascinating insights into immunology, genetics, protein chemistry, 
and the discipline of biology as a whole. In trying to understand how 
antibody is able to recognize such a multitude of different speci- 
ficities, science has benefited from the discovery of both VDJ 
recombination (see Chapter 5) and somatic hypermutation (see 
Chapter 25). In an attempt to reconcile the incongruity entailed by 
the observation of highly divergent N-terminal regions coupled to 
constant C-terminal domains, research has gained not only the 
once-heretical "two genes, one polypeptide" hypothesis (1), but also 
the concept of isotype switching (see Chapter 24). Thus, studies into 
immunoglobulin diversity have proven to be extremely profitable 
scientific endeavors. In addition, while diversity has been a hall- 
mark of the study of immunoglobulin since it was first recognized 
to be a salient feature, several aspects that derive from immunoglob- 
ulins * underlying uniformity have been used to glean understanding 
into protein structure-function relationships in general. 

Immunoglobulins were the first molecules described from the 
ancestral immunoglobulin superfamily (IgSF) (2-4). As an ever- 
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expanding gene family, members of the IgSF have been shown to 
be vital to issues of cell-cell interaction and molecular recognition 
in a variety of cell types and across several taxonomic boundaries. 
Many molecules central to the functioning of the immune system, 
including the antigen-specific chains of the T-cell receptor (TCR) 
(see Chapter 10) and the class I and II major histocompatibility 
complex (MHC) antigens (see Chapter 8), are counted among this 
group. Common to all members of the superfamily is the presence 
of one or more immunoglobulin-like domains. Three-dimensional 
structural analyses of proteins containing these regions have 
demonstrated that the conserved amino acid sequences that make 
up an immunoglobulin homology domain comprise a recurring 
structural motif that can fold into a compact globular subunit. 
These sub units, in turn, are capable of integrating into complex 
macromolecules (5,6). As a result, different immunoglobulin mol- 
ecular structures are similar not only to each other, but also to a 
multitude of other important proteins. 

Because such a concerted scientific effort has been made to 
understand the way in which immunoglobulin functions, a large 
volume of sequence information — at both the nucleotide and amino 
acid levels — is available in both the literature and public databases. 
Indeed, immunoglobulins have likely been sequenced more fre- 
quently than any other class of gene or protein. Similarly, 
immunoglobulins have been well represented in structural studies, 
crystallographic and otherwise, to an unprecedented degree. This 
mass of work has allowed a number of conclusions regarding struc- 
ture-function relationships of immunoglobulins to be made. 
Specifically, the aim of this chapter is not to compile an exhaustive 
catalogue of all extant work on the topic of immunoglobulin 
sequences, but rather to present the essential features of 
immunoglobulin structure and their relation to immunologic func- 
tion as is currently understood. Further, because immunoglobulin 
proteins have been so evolutionarily valuable, they can be found, in 
one form or another, throughout vertebrate species. Many of these 
molecules are only now being characterized, and surely many more 
are yet to be identified. As a consequence of this diversity, however, 
it is impossible to relate all of the details of immunoglobulin struc- 
ture and function in their entirety. Instead, unless otherwise noted, 
the examples of human and murine immunoglobulins will be used 
as models to convey the general conclusions garnered from scien- 
tific insight and experimentation into this important and fascinating 
class of proteins. The organization of this discussion will begin, fol- 
lowing an introduction to basic immunoglobulin features, with a 
consideration of the primary structure of antibody molecules and 
proceed through the secondary, tertiary, quaternary, and higher 
order immunoglobulin structural topics that derive from its 
sequence. Once this foundation has been laid, the functional attrib- 
utes of immunoglobulin will be considered, with an eye to correlat- 
ing an antibody's capacities — to the extent which it is possible — 
with that of its structure. A section on the IgSF follows, which will 
briefly address its evolution and also specifically detail particular 
IgSF members critical to immune responses that are not explicitly 
covered elsewhere in this volume. 

GENERAL IMMUNOGLOBULIN STRUCTURE, 
NOMENCLATURE, AND HISTORY 

Structural Considerations 

Figure 1 presents a diagrammatic representation of an antibody 
molecule. The typical immunoglobulin monomer is comprised of 




FIG. 1. Schematic representation of a prototypic immunoglobulin 
monomer. Each box symbolizes a complete immunoglobulin domain 
from either the heavy (shaded boxes) or light (unshaded) chain. 
Labeling of domains follows standard nomenclature, as outlined in 
the text. Interchain disulfide bonds are denoted by black bars. Note 
that these bonds are present between both heavy and light chain 
pairs and between the two heavy chains. Conserved N-linked car- 
bohydrate occurs on all C H 2 domains as shown, although some 
immunoglobulins are also glycosylated at additional sites elsewhere 
in the molecule. Also of note is the fact that all of the domains asso- 
ciate to form dimeric modules (WV L , C h 1/Cl, and Ch3/C h 3), except 
Ch2 domains. The Fab, Fc, and F(ab)' 2 proteolytic fragments are 
demarcated by bars to either side of the diagram. (From ref. 6a t with 
permission.) 



four polypeptide chains complexed together via hydrophobic 
interactions and stabilized by disulfide bonds. Due to allelic 
exclusion (see Chapters 5 and 6) B lymphocytes usually express 
only one functionally rearranged heavy chain gene and only one 
light chain polypeptide as well. Consequently, complete 
immunoglobulin proteins are composed of two identical heavy 
chain polypeptides of approximately 55 kD and two identical light 
chains of 25 kD. Each heavy and light chain pair is joined by one 
or more interchain disulfide bonds, and also relies upon non- 
covalent interactions to properly orient the two chains relative to 
each other. One such "half-antibody" contains a single antigen 
binding site (i.e., it is monovalent). The complete four polypeptide 
chain monomer is formed by similar hydrophobic bonding 
between the two heavy chains, and it also utilizes one or more 
disulfide bonds to stabilize the complex. Thus, a complete immun- 
oglobulin molecule is bivalent with two identical sites for poten- 
tial binding of antigen. As such, an immunoglobulin may be 
thought of as a "dimer of a heterodimer," although these half-mol- 
ecules do not occur naturally. 
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Each individual polypeptide chain consists of two to five do- 
mains of approximately 110 amino acids (7), each capable of 
folding independently. These domains form compact, protease- 
resistant structures which serve as the fundamental unit of immun- 
oglobulin structure. The interactions that allow for the formation of 
the aforementioned immunoglobulin monomer almost exclusively 
occur in pair- wise fashion between domains of two different 
polypeptide chains (see Fig. 1), such that the functional modules of 
an antibody are in fact dimerized domains. In addition, as each 
domain of an antibody molecule is encoded by a separate exon, 
immunoglobulin domains also serve as the essential element of 
antibody genetics. In this light, it is easy to recognize how evolu- 
tion has used the prototypical immunoglobulin domain as a sub- 
strate for experimentation, and as a result different domains have 
attained distinct structural and functional attributes. Moreover, the 
presence of one or more "immunoglobulin homology domains" 
also proves to be the distinguishing characteristic for inclusion in 
the immunoglobulin gene superfamily Thus, the duplication and 
adaptation of the Ig homology domain has occurred not only within 
the context of formal "immunoglobulin genes " but also in the 
greater scope of the IgSF, which far predates the emergence of anti- 
body. In either case, the archetypal immunoglobulin domain has 
clearly proven to be a powerful evolutionary tool, as will be 
detailed below and in greater detail throughout this chapter. 

The hallmark of all Ig domains is the presence of a structural 
motif termed the immunoglobulin fold. This characteristic feature 
is actually a specialized "p-barrel" typically comprised of seven 
polypeptide strands, which form antiparallel p-pleated sheets in the 
folded domain. This configuration is depicted in Fig. 2, which was 
deduced from x-ray diffraction studies of an immunoglobulin light 
chain (8). Each Ig domain is composed of two p-pleated sheets, 
one containing four p strands, the other consisting of at least three 
P strands (represented by arrows in Fig. 2). Loops of variable 
length connect the different strands, allowing the p sheets to form. 
The two P-pleated layers are oriented in a sandwich, enclosing a 
hydrophobic interior. Further stability is provided by a disulfide 
bond near the domain's core, which covalently links the two sheet 
layers. The cysteines that contribute this bond are conserved in all 
immunoglobulins, and in almost all proteins that possess Ig-like 
domains. Two residues, a tryptophan in strand 3-1 and an aromatic 
residue that precedes the second half-cystine, are also maintained 
consistently and serve to protect the disulfide bond in the three- 
dimensional structure. Other conserved features include hydropho- 



bic core residues, which stabilize the inside of the sandwich, and 
glycine and proline loop residues, which provide the flexibility 
necessary for the formation of these interconnecting sequences 
(9-12). 

Since the hydrophobic core residues are predominantly respon- 
sible for promoting the folding of the p sheets, and thus the entire 
immunoglobulin fold, the sequences of the loop residues are free to 
vary considerably. This, in turn, grants loop residues the freedom 
to serve as substrates for selection, at the level of selection of a par- 
ticular antibody in an immune response and at the level of natural 
selection in phylogeny. In this way, the prototypical immunoglobu- 
lin homology domain serves as a potent cofactor for the evolution 
of both organismal immunity and that of the species in general. 



Immunoglobulin Nomenclature 

Light chains contain two such immunoglobulin domains, 
whereas a heavy chain is made up of either four or five domains, 
depending on the type of heavy chain (isotype) used by the anti- 
body in question. Different immunoglobulin domains possess dif- 
ferent structural and functional characteristics, and their naming, in 
part, reflects these differences. The ammo-tenninal domain of each 
chain, whether of the heavy or light type, is termed a variable (V) 
region due to the discovery of extensive sequence divergence 
between different antibody proteins in this part of the molecule. 
These are designated Vh and Vl for heavy and light chains, respec- 
tively. V regions have been demonstrated to be responsible for the 
antigenic specificity of the immunoglobulin. 

Carboxy-terminal domains, on the other hand, display consider- 
ably less sequence variation within a given isotype and are referred 
to as constant (C) regions. Heavy chain C regions are numbered 
ChI, Ch2, and so on, beginning with the most V region-proximal 
domain. The constant region domains of the heavy chain have been 
shown to be responsible for many aspects of antibody function, 
including interaction with Fc receptors, complement fixation, 
transplacental transfer, the ability to multimerize, and the capacity 
to be secreted on mucosal surfaces. Because different heavy chain 
isotypes have different C region domains (i.e., the Ch3 domains of 
different isotypes are distinct), these capabilities vary with the 
class of the particular antibody. Five major classes of heavy chain 
C regions exist: alpha (a), gamma (y), delta (5), epsilon (e), and 
mu (u.). As a direct consequence of the correlation between the 




FIG. 2. Ribbon drawing of the V and C domains of a light 
chain, p strands are depicted as arrows, with those of the 
four-stranded face unshaded and those of the three- 
stranded face shaded. Strands are numbered according 
to Edmundson and lettered {in parentheses) according to 
Hood. Intrachain disulfide bonds are represented as black 
bars. Selected amino acids are numbered, with position 1 
being the N-terminus. Residues 26, 53, and 96 corre- 
spond to amino acids in CDRs 1 , 2, and 3 t respectively. 
The dimerization surfaces of each domain (four-strand 
side of the C domain, three-strand side of the V domain) 
face upwards. (Adapted from ref 8, with permission.) 
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heavy chain class of an antibody and its resultant effector func- 
tions, immunoglobulins are named according to their heavy chain, 
using an English-letter terminology (IgA, IgG, IgD, IgE, IgM), 
which corresponds to their Greek letter isotypes. Specific domains 
of C regions are often designated according to the class of heavy 
chain from whence they originate as well (i.e., the Ch3 domain of 
a |i antibody is signified by Q3). Owing to the propensity of 
immunoglobulin domains to evolve independent of one another, 
oftentimes a particular domain of a specific isotype may be respon- 
sible for one or more functional characteristics of the entire anti- 
body, making this naming system particularly relevant. On the 
other hand, the constant regions of light chains, possessing only 
one C domain, are usually simply denoted by Cl. The two light 
chain classes, kappa (k) and lambda (X), may be indicated by the 
use of Cic or Cx designations. No distinct functional attributes have 
as yet been ascribed to either the k or X light chain isotypes. 

Finally, immunoglobulins also have hinge regions located C-ter- 
minal to the Ch 1 domains of their heavy chains. In the case of heavy 
chains of the ji and e isotypes, the hinge is so elongated that it is 
actually an extra immunoglobulin domain, explaining the presence 
of a fifth C domain in these molecules. Other heavy chain classes 
use shorter stretches of protein, which are thought nonetheless to 
have evolved from the Cn/ £ 2 domain. Consistent with the indepen- 
dent evolution of the other domains of immunoglobulin genes, 
hinge regions are generally encoded by individual exons as well. As 
the name implies, the hinge permits a generous degree of flexibility 
between the antigen-binding and effector-interacting components of 
the molecule. Thus, the hinge domain facilitates linking of the two 
disparate elements of immunoglobulin function: the ability to inter- 
act with an endless array of structural determinants on antigenic 
surfaces (mediated by V regions) and the capacity to interact with a 
limited number of effector-activating molecules (mediated by C 
regions). In addition, disulfide bridges between the two heavy 
chains typically occur within the boundaries of the hinge region, 
allowing the complete tetrameric complex to form. 

Hence, in many cases, the discrete elements of immunoglobulin 
structure defined both genetically and structurally as immunoglob- 
ulin domains are also responsible for specific functional qualities. 
Moreover, in addition to the one-domain-per-exon correlation that 
exists for immunoglobulins, in both heavy and light chains the V 
region domain and the domains of the different C regions are in 
fact distinct genes (13). This type of genetic arrangement allows 
the ability to recognize a specific antigen to be united with the 
effector functions that are most appropriate for that particular 
immune response at that particular time. In this regard, then, it is 
clear that antibodies truly embody the linkage of structure to that 
of function. 



An Historical Perspective 

Long before x-ray diffraction of crystals had yielded the keys to 
dissecting the structure of immunoglobulin, other seminal studies 
had been performed that, in retrospect, agree completely with the 
conclusions drawn from crystallization analysis. Many of these 
experiments focused on the basic protein chemistry of pooled IgG, 
using the techniques of proteolysis, reduction, and denaturation. 
First, it was revealed that papain digestion of IgG would render two 
types of protein fragments: Fab, a monovalent antigen binding 
fragment, and Fc, an easily crystallizable fragment (14). Soon 
after, it was recognized that pepsin treatment of IgG produced an 



antigen-binding fragment designated F(ab)' 2 which had bivalent 
activity (15). Furthermore, if this fragment was treated with reduc- 
ing agents, two univalent Fab' fragments could be obtained. These 
different fragments are schematically represented in Fig. I. Reduc- 
tion and dissociation of IgG also demonstrated that identical heavy 
and light chains were complexed via disulfide bonds (16). This and 
other work eventually allowed investigators to decipher a working 
model for immunoglobulin structure consisting of four polypeptide 
subunits — two identical heavy chains and two identical light 
chains — stabilized by multiple interchain disulfide bonds (17,18), 
which we now know to be correct. 

Early studies of a different type also proved successful in reveal- 
ing information about antibodies; these experiments focused upon 
utilizing the immune system itself as a means to decode aspects of 
immunoglobulin structure (reviewed in refs. 19-21). As large gly- 
coproteins, antibodies themselves are potent immunogens capable 
of eliciting vigorous humoral immune responses. Investigators 
used immunoglobulin preparations (initially either heterogeneous 
total serum immunoglobulin or homogeneous myeloma or plasma- 
cytoma proteins, and later monoclonal antibodies from hybrido- 
mas) as antigens to generate antibody responses by injecting them 
into animals of differing species or different animals of the same 
species. The antibodies produced by these immunization protocols 
proved useful in resolving several key elements of immunoglobu- 
lin structure, and many of the antigenic determinants recognized by 
these antisera have subsequently been shown to correlate exactly 
with known structural features of immunoglobulins. 

A three-tiered serological classification scheme for immuno- 
globulin was devised using these antisera (after adsorption) as 
reagents to categorize antibody molecules into distinct groups. The 
first tier of organization is that of the isotype. Isotypes define C 
region determinants and, as such, distinguish heavy and light chain 
constant regions from one another. Initially, five heavy chain 
classes were recognized and given the Greek letter designations 
mentioned in the previous section. The presence of these five iso- 
types in virtually all mammals for which immunoglobulin profiles 
have been determined indicates that the divergence of C region 
genes occurred at an early stage of mammalian evolution. Simi- 
larly, light chain constant regions were also divided into discrete K 
and X classes. Soon after, refinements made clear that two of the 
human heavy chain classes, a and y, in fact contained several 
related members that could be further divided into subclasses. 
Human IgA is separated into ctl and a2 subclasses, and human 
IgG is separated into four y subclasses: yl, y2, y3, and y4. Murine 
IgG is also composed of four y subclasses (yl, y2a, ?2b, and y3), 
although their structural and functional characteristics — and their 
abbreviated designations — do not agree with their human counter- 
parts, thus indicating this diversification occurred after these 
species* evolutionary divergence. Each different isotype (whether 
class or subclass) is represented by a separate C region gene in the 
haploid genome, and all isotypes are present in the sera of all nor- 
mal individuals of a given species. 

Allotypes, on the other hand, refer to determinants found on the 
antibodies of some, but not all, members of a species. These deter- 
minants are encoded by one allele of a particular C region gene 
(either heavy or light chain) and are inherited in typical Mendelian 
fashion as autosomal dominant traits. Compilations of human allo- 
types are summarized in ref. 22 and are covered more extensively 
in ref. 23. Whereas isotypes and allotypes are localized to the C 
regions of immunoglobulins, idiotypes are antigenic determinants 
found on the V regions of antibodies, and they frequently correlate 
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with binding specificity. Generally, idiotypes are present only in an 
individual member of a given species, and these antigenic epitopes 
reflect the uniqueness of each individual immunoglobulin mole- 
cule. An idiotypic determinant defined by a monoclonal antibody 
is called an idiotope, Idiotypes are not always restricted to the indi- 
vidual, however. Occasionally, when two individuals are chal- 
lenged with the same antigen, they will produce antibodies that 
share the same idiotypic determinant s). In such cases, the idiotype 
is called a cross-reactive, or public, idiotype (24). Cross-reactive 
idiotypes represent the usage of the same V gene segment by dif- 
ferent individuals. Thus, idiotypes may be best thought of as being 
restricted not to the individual organism, but to the individual 
immunoglobulin molecule. 

Obviously, a tremendous amount of effort, using a variety of sci- 
entific approaches, has been focused upon attempts to understand 
immunoglobulin structure. It is remarkable, though, that the vast 
majority of this early work, whether utilizing protein chemistry to 
resolve basic structural characteristics or manipulating the humoral 
immune response to generate reagents to classify immunoglobulin 
proteins relative to one another, has in fact identified, and in many 
cases answered, many of the crucial questions of immunoglobulin 
structure correctly. As shall be discussed in the following sections, 
many of the crucial structural features of the antibody — from pri- 
mary sequence to quaternary associations — were first inferred 
from these initial landmark studies. 

IMMUNOGLOBULIN STRUCTURE 

Primary Structure — Two Genes, One Polypeptide 

The assertion that each immunoglobulin chain derived from two 
distinct genetic entities was a novel and provocative hypothesis at 
the time it was proposed (1), and it has proven to be correct. Owing 
to this genetic independence, the structures of the V and C regions 
of immunoglobulin will be treated separately here as well, although 
they obviously share numerous commonalties. 

Among the most remarkable discrepancies between V and C 
regions are the differences in their genetic organization. While the 
different heavy and light chain C regions are encoded by fewer than 
20 genes, V region genes (V H , V K , and Vx) number in the hundreds. 
Further distinguishing the V region loci is the fact that genes for 
complete V H or Vl domains are not present in the genome origi- 
nally, but are "recombined" at the genetic level according to the 
processes of somatic diversification described in Chapter 5. This 
recombination of V, D, and J elements to form functional heavy 
chain genes (or V and J in the case of light chains) imposes another 
degree of diversity upon the V regions. Due to these complexities, 
sequence variability is in fact the hallmark of variable region 
domains. In addition to the differences between V and C domains 
in their genetic design and construction, V region sequences also 
have uniquely identifiable characteristics. One of the most easily 
recognizable features is the fact that V regions are approximately 1 6 
residues longer than the prototypic 1 10 amino acid immunoglobulin 
domain. These extra residues allow V regions to form a distinctive 
immunoglobulin fold structure using two additional p strands, 
which distinguishes V domains from C domains and also has impli- 
cations for V region function. Also, the processes of V(D)J recom- 
bination can further alter the germline-encoded length of V regions, 
subtly affecting V domain structure and function as well. 

V regions form the amino-terminal domains of heavy and light 
chains, and their primary responsibility is the binding of antigen. 



The promotion of the capacity to recognize antigenic determinants 
has been the driving force behind the evolution of V genes, both at 
the structural level of individual genes and for the evolution of the 
different V loci. When sequence data first became available from 
antibody proteins (see Fig. 3), it was apparent that great variation 
existed between V regions relative to that found between C regions. 
A means was developed to quantitate this variation whereby vari- 
ability was defined as the number of different amino acids 
observed at a given position divided by the frequency of the most 
common amino acid at that position (25). Using this equation, an 
invariant residue would have a variability equal to one, whereas the 
theoretical upper limit for a position occupied by each of the 20 
amino acids in a random fashion would be 400. This can be illus- 
trated graphically by plotting the variability scores of a particular 
protein against its residue number, as is demonstrated in Fig. 4. 
Variability plots of this type established not only that V regions 
were characterized by diversity in their sequences, but that this 
variation was principally clustered in three regions, which were 
deemed hypervariable regions (HVRs). It was hypothesized that 
these highly variable segments of heavy and light chains would 
coordinate in such a way as to form the antigen-combining site 



LIGHT CHAIN VARIABLE REGIONS 

Ag: DIQFTruSPSSLSASVGDRVTITCQASQ- A -- 2 - 7 --DINHYL 
Len : ** V*****NS*AV*L*E*A**N*KS**SVLYSSNSKN** 
Ti: E*VL****GT**L*P*E*A*LS*R***S— — VSNSF* 



Ag: NWYQQGPKKAPKILIYDASNLETGVPSRFSGSGFGTDFT 
Len : A****K*GOP**L***W**TR*S***D******S***** 
Ti : A****K*GG**RL***V**SRA**I*D******S***** 



Ag: FTISGLQPEDIATYYCQQYDTLPRTFGQGTKLEIKRT 
Len : L***S**A**V*V******YST*YS***********T 
Ti : L***R*E***F*V******(iSS*S*******V*L**T 



KAPPA LIGHT CHAIN CONSTANT REGIONS 
108 

Ag : TVAAPSVFI FPPSNEOLKSGTASVVCLLNNFYPREA 
Len: ************************************ 
Ti : *************D********************** 



Ag : KVQWKVDNALQSGNSQES VTEODSKDSTYSLSSTLT 
Len: ************************************ 
Ti: ************************************ 



Ag: LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC 
Len: **********************************- 
Jx ; ********************************** 

FIG. 3. Amino acid sequences of human antibody light chains. 
Dashes denote gaps introduced to optimally align sequences; aster- 
isks represent identity relative to the top sequence. These are 
among the first immunoglobulin sequences ever obtained, demon- 
strating the dramatic differences tn variation between V and C 
regions. The appearance of clusters of conserved residues within 
the different V domains also illustrates the necessity of a system to 
accurately quantify variations between several sequences (see Fig. 
4). (Developed from the sequence compendium of Kabat et al. in ret. 
24a.) 
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(reviewed in ref. 26); thus they were termed complementarity- 
determining regions (CDRs) as well. 

Variability analysis also determined that other stretches of V 
region sequence were reasonably well conserved from protein to 
protein; these were presumed to perform basic structural functions 
necessary for proper folding of all V domains. Accordingly, they 
were dubbed framework regions (FRs) because they provide the 
platform that supports the CDRs. Structural analyses have con- 
firmed that the FRs largely coincide with the P strands of the 
immunoglobulin fold, while CDRs, on the other hand, chiefly cor- 
respond to the loops that join p strands on the C region-distal end 
of the V domain. A linear representation of this association is 
shown in Fig. 5; note that CDRs 1 , 2, and 3 join P strands B and C, 
C and D, and F and G, respectively. Significant sequence motifs are 
also apparent in this comparison of six V region proteins: a W7F- 
G-X-G motif in FR4 that is common to all V domains (27), the V H - 
specific G-L-E-W-hydrophobic stretch in FR2 (28), and the V L - 
specific sequence P-hydrophilic-hydrophobic-L-hydrophobic in 
the analogous FR2 location (28). These motifs are vital for proper 
dimerization of domains and will be discussed further in the sec- 
tion on quaternary immunoglobulin structure. Another important 
distinction between heavy and light chain V region sequences is 
also apparent in Fig. 5: relative to V L domains, V H regions gener- 
ally utilize longer fRl and CDR2 segments and shorter CDR1 and 
FR2 stretches. While the vast number of V genes precludes the 
ability to definitively assign boundaries for FRs and CDRs that are 
constant among all immunoglobulin V regions, Table 1 summarizes 
the traditional positions that delineate these areas for both heavy 
and light chains. 

The presence of hundreds of different germline V region genes 
obviously contributes greatly to the sequence diversity of different 
variable domains. However, the somatic process of V(D)J recombi- 
nation (see Chapter 5) further accentuates V region variability, 
specifically targeting the CDR3 of the protein (29). In this system! 
approximately 100 unique V genes (V H , V K) or V x loci) encode the 



N-terminal FR1-CDR1-FR2-CDR2-FR3-5'CDR3 portions of \ 
regions, while four to six "joining" (J) minigenes code for the car- 
boxy-terminal 3'CDR3-FR4 segments. Heavy chains also incorpo- 
rate one of about 30 short "diversity" (D) gene segments betweer 
V and J genes to generate complete V region domains. The rela- 
tionship between rearranged V(D)J gene segments and the 
FR/CDR organization of the V region is schematically represented 
in Fig. 6. The combinatorial assortment of gene segments to form 
complete heavy and light chains, followed by the combinatorial 
assortment of heavy and light chains with each other to form anti- 
gen-binding Vh/Vl dimers, results in a practically limitless number 
of V domain structures. Moreover, during the recombination 
process itself, the activity of exonucleases and untemplated N- 
segment additions (29), templated P-nucleotide incorporation (30), 
and D-D fusion events (31) can boost the diversity of CDR3 even 
further. Finally, superimposed on these aspects of "combinatorial" 
and "junctional" diversity-generating mechanisms, somatic hyper- 
mutation (32,33; see Chapter 25) serves to introduce still more 
variation by altering residues throughout the V region. 

Despite— and perhaps as a result of— the seemingly endless 
number of possible V region sequences, sophisticated schema have 
emerged for their classification. These groupings are based upon 
homology-based hierarchies that directly reflect the evolution of 
the antibody gene loci. Members of a group are more similar to 
each other than to all other sequences from other groups and share 
linked amino acid substitution patterns, which serve as "identi- 
fiers" for the various classifications. The most evolutionary distant 
grouping is, of course, that of the V regions themselves, followed 
by the V H , V*, and V\ distinctions, which represent the separate V 
gene loci. In humans, the heavy chain locus is found on chromo- 
some 14, and the k and X loci are found on chromosomes 2 and 22, 
respectively (34-36). In the mouse, these genes are located on 
chromosomes 12, 6, and 16 (37-39). Other stratifications for V 
region organization also mirror the evolution of the antibody gene 
loci. The use of "clans" to categorize V genes has demonstrated the 
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TABLE 1. 


Boundaries delineating the statistical and structural 




subdomains of variable regions 
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Chain 


positions 


structural loop 


FR1 


Heavy 


1-30 






Light 


t-23 


H1: 26-32 


CDR1 


Heavy 


31-35* 




Light 


24-34* 


L1: 26-33 


FR2 


Heavy 


36-49 






Light 


35-49 




CDR2 


Heavy 


50-65* 


H2: 53-55 




Light 


50-56 


L2: 50-52 


FR3 


Heavy 


66-94* 






Light 


57-88 


H3: 96-101 


CDR3 


Heavy 


95-102* 




Light 


89-97* 


L3: 91-96 


FR4 


Heavy 


103-113 






Light 


98-107* 





FR1— 4 (framework regions), CDR1-3 (complementarity-determin- 
ing regions), H1-3 (heavy chain variable loops), and L1-3 (light chain ■ 
variable loops) are numbered according to Kabat et al. (1) using an 
alignment giving priority to conserved residues. Asterisks indicate 
regions which may have length variations depending upon germline 
V gene usage and/or junctional diversity. Data for the table were 
compiled from Kabat et al. (24a) and Chothia and Lesk (28a). 




FIG 6. Comparison of the gene structures and protein subdomains 
of rearranged V regions. (A) Heavy chain V domain. (B) Light chain 
V domain. The V H and V L gene segments are represented as 
hatched boxes, the D H gene segment as a white box, and the Jh and 
J. qene segments as black boxes. Framework regions (FRs) are dis- 
played as shaded boxes and complementarity-determining regions 
(CDRs) are pictured as checkered boxes. An approximate amino 
acid scale separates the two diagrams. 
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development of V loci across several vertebrate species (see Fig. 7) 
for both heavy and light chains (40-44). Three Vh clans have been 
recognized using nucleotide sequence homology comparisons 
across the FR1 6-24 codon interval. While this stretch of FR1 
sequence is conserved within a clan, a similar span (the 67-85 
codon interval of FR3) can also be used to discriminate between Vh 
genes that belong to the same clan but differ in regard to the next 
level of classification, that of the family. 

Classically, families are the groupings that have been used most 
frequently to identify and categorize V region genes relative to one 
another (reviewed in refs. 45—47). Members of a V region family 
share about 80% identity at the DNA level. Historically, when 
genomic Southern blotting was used to work out approximate fam- 
ily sizes, this degree of homology allowed for sufficient cross- 
hybridization to occur under low-stringency conditions, accounting 
for the utility of the family designation. At* the protein level, this 
translates to levels of about 75% identity between gene products 
from the same family and less than 70% homology for proteins 
belonging to different families. Using these criteria, human Vh 
genes may be segregated into seven families, V K into four major 
families, and Vx into ten families. The murine system (which con- 
tains larger absolute numbers of V genes) is more complicated, as 
evidenced by the fact that 14 Vh and 20 V K families have been rec- 
ognized. Example sequences from several human V K and V H gene 
families are aligned in Fig. 8. Note the presence of numerous 
."shared substitutions" within pairs of sequences belonging to the 
same family that are not present in the other sequences. These 
serve as distinctive "signature residues" which facilitate rapid 
identification of a particular sequence's "family of origin." The 
sequences in Fig. 8 also demonstrate another important character- 
istic of V gene families: Different families frequently possess dif- 
ferent CDR lengths. Thus, independent of amino acid sequence, V 
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region families intrinsically possess differing binding-site strut 
tures, thereby affecting their functional capabilities. 

Shared substitutions have also been used to further refine fam 
lies into subfamilies. Subfamilies, as are all classification schema i 
some extent, are particularly arbitrary divisions, largely because tl 
parameters used to define subfamilies are not standardized. Gene 
ally, the features that describe a particular subfamily are esoter 
and depend upon the specific characteristic(s) being studie 
Finally, at the most descriptive level of classification, the work < 
Rabbitts, Winter, Honjo, and many others (46-50) has resulted : 
the identification, mapping, and — to some extent — sequencing < 
presumably most, if not all, human V region genes. Within a famil 
single V gene segments may be compared against a consensi 
sequence representing that family (Fig. 9). When such a comparisc 
is performed, it becomes clear that individual V genes' divergent 
is focused primarily in their CDR1 and CDR2 segments (42). Thu 
on the basis of primary sequence structural information, eve 
closely related V genes may be predicted to adopt similar fram 
work cores but differ in terms of their CDR1 and CDR2 loops sue 
that a plethora of potential antigenic specificities are encode 
genomically. In fact, even at the level of the individual V gene vai 
ability persists in this system. As one might expect with "variabU 
genes, allelic variation and polymorphism of the antibody lo 
exists as well — such that it is probably safe to conclude that all * 
the possible incarnations of immunoglobulin V genes will nev« 
truly be identified and categorized. Suffice to say, then, that tl 
variable gene loci, clans, families, subfamilies, and even individu 
V genes themselves all derive from gene duplication events; on 
the period in evolutionary time at which the duplication occurrc 
truly separates one gene or group of genes from any another. 

While V regions provide the surfaces that interact with "foreigr 
antigenic determinants, constant regions perform the function « 



Ancestral 
Clan III 
Vh Gene 



Clan I 



Clan III 

i 



Clan II 
I 



Vh5 
Vr9 



Xenopus Mouse 



J558 Va 1 
Va>3.8 V B 5 
Vh7 



Vh1 
Shark 



VhI 



VhI 

Chicken 



Val 



Human 



Xenopus 



Rabbit 



7183 
T15 
J606 
X24 
VhIO 
VhII 
Vh13 
Mouse 



Vh3 
Human 



Vh2 36-60 Vb4 
Vh3 Q52 Vb6 
Vh4 Vb12 Hxi man 

V"5 House 
Vh7 

Xenopus 



3609 Vh2 
House Huma 
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binding to the "self" molecules that mediate physiologic and 
immunologic effector pathways. Light chains have one constant 
region domain (either C K - or C\) C-terminal to the V L . The C L pairs 
with the first constant region domain of the heavy chain (ChI) 
using hydrophobic and disulfide interactions to stabilize heavy and 
light chain coupling. Cl is not known to specifically bind any other 
biological moities; therefore, no known effector properties are 
attributable to light chain C regions. By contrast, all known qua- 
ternary associations, biologic characteristics, and physiologic func- 
tions of immunoglobulins are governed by heavy chain C regions. 
Depending upon antibody isotype, heavy chain C regions consist of 
either three or four Ch domains. The first domain (ChI) mediates 
association with Cl as detailed above and is part of the Fab frag- 
ment, while the following two (or three) C-terminal domains par- 
ticipate in interchain binding between heavy chain molecules and 
collectively comprise the Fc. Connection between ChI domains of 
|i and e antibodies and their respective Fc is accomplished by spe- 
cialized Ch domains (C^2 or C e 2) that allow for flexibility of the 
Fabs, yet maintain features of other Ig constant domains. Antibod- 
ies of the y, a, and 5 classes, however, utilize a shorter flexible 
hinge segment for this purpose, which will be specifically detailed 
elsewhere. 

While initial studies indicated that C region sequences were rea- 
sonably highly conserved — at least relative to V regions (refer to 
Fig. 3) — their designation as "constant" is actually somewhat of a 
misnomer. For instance, studies comparing V and C regions from 
primitive species have suggested that the lengths of the loops con- 
necting the p strands of constant regions actually vary more than do 
those of variable regions (51). Furthermore, between different iso- 
types of the same species, Ch regions share only about 30% 
sequence identity overall (Fig. 10). C region domains differ in 



regard to their interclass homologies such that different domains 
show various levels of sequence conservation. ChI domains are the 
most similar between isotypes; this may derive from the fact that all 
share the common function of pairing with immunoglobulin light 
chains. Similarly, the carboxy-terminal domains of \i, a, and Y(Qi4, 
C a 3, and 0,3) are substantially more related than the average for 
their Fc as a whole. As x-ray studies have indicated close contact 
exists between the C-terminal domains of the Fc, this sequence 
conservation may result from similar constraints as for the ChI/Cl 
situation. Moreover, in the case of the \i and a chains, the relatively 
higher homology likely reflects the common role of the last domain 
in multimer formation. Also note that the Cji4 and 0*3 domains 
each possess an 18-amino acid "tailpiece" at their C-termini. The 
penultimate cysteine residue in each sequence contributes to an 
intersubunit disulfide bond, allowing IgM and IgA polymerization 
(52). Much like the V region paradigm, a hierarchy of shared sub- 
stitutions can be used to distinguish sequences of different 
domains, classes, and subclasses. In all comparisons between iso- 
types, however, it is apparent that the majority of conserved 
residues are localized to the (3 strands. As was the situation with the 
V region FRs, these strands are responsible for the proper folding 
of the domain. In addition, the two cysteines that form the intra- 
chain disulfide bond, and the tryptophan which protects it from sol- 
vent reduction, are preserved among all Ch sequences as well. 

In analogous fashion to V region evolution, the five heavy chain 
isotypes likely arose by duplication and diversification of a com- 
mon gene precursor. This probably occurred in an ancestral organ- 
ism that preceded mammalian speciation, because examples of all 
five classes appear in all mammals. Actually, interspecies sequence 
comparisons of a given isotype demonstrate their similarity far 
exceeds that of the different isotypes within a species (compare 
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Figs. 10A and 1 1). Subclasses represent more recent gene duplica- 
tion events. Accordingly, distinct subclass profiles exist in various 
mammalian species. For instance, while humans have two a loci 
and hence two IgA subclasses (53), murine species have only one 
a gene, and rabbits have 13 (54)! Due to their later evolutionary 
divergence, heavy chain subclasses display greater sequence simi- 
larity — on the order of 60% to 90% — as evidenced by the y\ and 
y3 alignments depicted in Fig. 10. Despite the high levels of con- 
cordance between C regions of related subclasses, even slight dif- 
ferences can have profound functional repercussions. As an exam- 
ple, among the four human IgG subclasses (whose sequences are 



over 95% identical to one another), IgGl and IgG3 bind to 
macrophages and other phagocytes with ease, while IgG2 and 
IgG4 bind very poorly. This binding is mediated by an Fey recep- 
tor that has been extensively characterized (detailed later within 
this chapter). Similar subtle sequence differences are also involved 
in functional properties affecting immunoglobulin catabolism, pla- 
cental transfer, and reactivity with staphylococcal protein A (SPA). 
It is this selective pressure for the ability to perform in a variety of 
functional capacities that has both- maintained the five major Ch 
classes throughout mammalian species and also driven the evolu- 
tion of subclasses to their points of divergence in these same 
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FIG. 11. Protein sequence alignments of the C^1 domains from four mammalian species. Labels are as for Fig. 10. 
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relative to the alignments of the different human Ch1 isotypes in Fig. 10A. 



species. Collectively, these evolutionary changes have bestowed 
upon the different isotypes the ability to respond to antigenic chal- 
lenge in a variety of immunologically productive ways. 

Despite the absence of evidence to suggest that light chain C 
regions participate in biologically significant interactions (other 
than coupling with heavy chains), multiple forms of Cl have also 
been identified. First, the light chain isotypes C K and C\ — while 
functionally indistinguishable — exist as separate genetic loci with 
their own complement of possible V and J gene segment partners. 
This is in contrast to the situation with the heavy chain locus, in 
which any rearranged VDJ can potentially become associated with 
any Ch isotype via the process of class switching (see Chapter 24). 
While the use of k versus X isotypes seems to be inconsequential 
as to the antibody's efficacy, differences in their utilization are 
nonetheless present. In human immunoglobulins the k:X ratio is 
approximately 70:30, while in murine systems about 95% of anti- 
body is of the k class (20). The reasons for these imbalances are yet 
to be definitively explained, but are probably related to the number 
of V K and Vx genes available for use in the respective genomes of 
these organisms. As was the case for heavy chain C regions, light 
chain isotypes are well conserved across species (Fig. 12). In addi- 
tion, within a species, k and X classes are more similar to each 
other (about 38%) than were the different Ch isotypes (compare 
Figs. 10 and 12). Note also the presence of a terminal (C K ) or 
penultimate (Cx) cysteine residue in these sequences. This half- 
cystine is responsible for the light chain's contribution to the H-L 
interchain disulfide bond. A second point of variation within Cl 
regions concerns C\ subclasses. While only a single C K gene is pre- 
sent in human and mouse, five or four A,-chain subclasses exist in 
human or murine genomes, respectively. A final deviation from 
"constancy" involves the C K domain. While only one k constant 
region gene is found in the locus, three human K allotypes have 
been identified. These alleles differ in regard to their residues at the 
surface-exposed positions 153 and 191, such that these allotypic 
markers are able to serve as antigenic determinants (55). 

As mentioned previously, hinges — either as distinct domains in 
the cases of the \jl and e chains or as shorter specialized segments 
for a, y, and 8 antibodies— connect the Fab and Fc portions of the 



immunoglobulin molecule. However, the lack of extensive hinge 
structure in nonmammalian heavy chain sequences indicates that 
the evolution of this function largely occurred after mammalian 
radiation (51). Consequently, discussion here is restricted to mam- 
malian hinges as typified by human sequences. 

Hinge regions not only connect Fab to Fc, but also contain the 
disulfide bonds that covalently link the two heavy chains (dis- 
cussed further in the section on quaternary structure) in the Fc por- 
tion of the antibody molecule. They display great variability 
between isotypes and are generally encoded by unique exons. For 
example, the hinges of human IgGl, IgG2, and IgG4 are each pro- 
duced from a single short exon encoding a peptide of between 1 2 
and 15 amino acids. Alternatively, the IgG3 hinge derives from 
four distinct exons, resulting in a hinge region that spans approxi- 
mately four times as many residues as the other y isotypes (56). 
Owing to similarities between the different hinge sequences and 
the extra C>i and C e domain sequences, it is thought that hinges 
evolved from the 0^2 domain; unfortunately the hinge sequences 
are too short and the homologies too weak to trace hinge lineage 
with certainty. It is clear from sequence comparisons, however, that 
the Cy3 hinge evolved from the Cyl hinge by multiple duplication 
(further substantiated by the aforementioned Cyl and Cy3 hinge 
exon arrangements). Figure 13 aligns the extra domains of Qi and 
C e , in addition to the hinges of several other human isotypes. 

In addition to the variable, constant, and hinge regions of anti- 
bodies, other primary sequence features must also be considered. 
For instance, both heavy and light chains of immunoglobulin are 
synthesized with a leader peptide (almost entirely encoded by a 
separate exon upstream of the V region) of 16 to 26 amino acids in 
length. During translation, this leader (more generally referred to 
as a signal peptide) directs the mRNA/ribosome/polypeptide com- 
plex to the rough endoplasmic reticulum (RER) where translation 
is completed. During the synthesis and extrusion of the nascent 
polypeptide through the RER membrane, the signal peptide is then 
removed by specific proteolytic cleavage. 

Finally, for a comprehensive discussion of primary immunoglob- 
ulin structure, mention must be made of sequences present exclu- 
sively in the case of surface immunoglobulin expression. Heavy 
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FIG. 12. Comparison of the C« and Cx domains from three mammalian species. Labeling is as for Figs. 10 and 11, 
except vertical lines now represent amino acids shared among all six sequences and asterisks specifically denote 
the invariant tryptophan and cysteines at the core of the domain (also shared by all sequences). As in Fig. 11. little 
gapping is needed and high homology persists across species. 
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chains of all isotypes can exist either as secreted antibody or as 
membrane-bound immunoglobulin (mlg), which serves as the cen- 
tral component of the antigen-specific B cell receptor (BCR). The 
choice between mlg and secreted immunoglobulin is manifest at 
the level of alternative mRNA splicing at the 3' end of the message. 
Differential processing in favor of the surface immunoglobulin 
form results in the replacement of the hydrophilic carboxy-termi- 
nal residues of secreted antibody with a stretch of hydrophobic 
amino acids (which anchors the immunoglobulin in the cell mem- 
brane) and a short cytoplasmic tail (57,58). Expression of these 
sequences not only tethers the immunoglobulin to the cell surface, 
but also governs the ability of mlg to interact with other con- 
stituents of the BCR necessary for signal propagation and eventual 
activation by antigen (see Chapter 7). As such, the regulation of 
this splicing event and its ultimate protein products is tightly con- 
trolled through the stages of B lymphocyte differentiation so as to 
ensure the proper production of membrane immunoglobulin versus 
secreted antibody at the appropriate developmental stage of the B 
cell (see Chapters 5 and 6). 

Secondary Structure — The Immunoglobulin Fold 

With the exceptions of the hinge and cytoplasmic tail, the prop- 
erties of the immunoglobulin fold as a protein motif dominate all 
aspects of immunoglobulin structure, from primary to quaternary. 
In regard to secondary structure, this refers to the different patterns 
of p-pleated sheet formation assumed by the V and C regions. As 
explained earlier, all immunoglobulin folds are composed of two 
layers of antiparallel p sheet arranged in a sandwich (or p-barrel) 
that encloses a hydrophobic interior. Early on, it was recognized 
that each immunoglobulin domain contains seven polypeptide p 
strands, four of which comprise one P-pleated sheet, the other sheet 
consisting of the remaining three strands. Accordingly, a nomen- 
clature was developed that reflected within which P sheet (four- 
stranded or three-stranded) a particular strand was located, using 
numbering (3-1, 3-2, 3-3 and 4-1, 4-2, 4-3, 4-4). Subsequent stud- 



ies revealed that superimposed upon this secondary structural orga 
nization shared by all immunoglobulin folds was a discrepanc; 
between V and C region immunoglobulin folds; some of th 
"extra** amino acid residues found in variable regions actually par 
ticipated in p sheet formation, giving rise to two additional | 
strands. Consequently, the immunoglobulin folds of V domain 
actually form a barrel using five-stranded (analogous to the ( 
region three-strand sheet) and four-stranded p-pleated sheet layer? 
A second naming system using letter designations (A, B, C, C, C 
D, E, F, and G) for the different strands of immunoglobulin fold p 
pleated sheets makes allowance for these extra p strands. Figure 1- 
displays "unfolded" immunoglobulin folds of each type so as t« 
schematically represent the secondary structure of both V and ( 
immunoglobulin domains. Note that while the letter nomenclatur 
proceeds in accordance with the primary structure of the protei: 
(A, B, C, D, etc.), the numbering system of p strands is nonlinea 
with respect to the primary sequence (4-1, 4-2, 3-1, 4-4, 4-3, 3-^ 
3-3). Rather, this naming system is designed so as to coincide wit 
the three-dimensional orientation of the strands in the context o 
the fully folded immunoglobulin domain (refer back to Fig. 2" 
which will be discussed in greater detail below. 

Tertiary Structure — The Immunoglobulin Domain 

Logically, if the p sheets of the immunoglobulin fold are th 
dominant secondary structural protein motif of the antibody, the 
the fully-folded tertiary structural correlate of this paradigm is the 
of the immunoglobulin domain. As has been explained, Ig domain 
are founded upon the premise of two layers of P-pleated shee 
sandwiched around a core of hydrophobic side chains, to form 
compact globular structure. Of course, upon this general structure 
framework, Ig domains vary considerably. Still, certain feature 
common to all Ig domains of actual immunoglobulin molecules (a 
opposed to Ig domains of other IgSF members) are invariant. Cen 
tral to the Ig domain, both literally and figuratively, is the presenc 
of the two cysteines that form the intradomain disulfide bond an 
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FIG. 14. Schematic diagram of the secondary structure of "unfolded" immunoglobulin domains, p strands are repre- 
sented by horizontal lines, loops connecting p strands are depicted as vertical lines. Bold horizontal lines are in the 
four-stranded face of the domain, light horizontal lines are in the three- (five-) stranded p sheet. Dotted regions 
denote CDRs. Strands are numbered according to Edmundson and lettered {in parenthesis) according to Hood. In 
three dimensions, immunoglobulin domains are folded with the tight horizontal strands under the bold horizontal 
strands using the dotted line as an axis. 
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the tryptophan which protects this bond from hydrolysis. All 
domains of immunoglobulin — variable region or constant, heavy 
chain or light-conserve these three key residues, and they occupy 
homologous positions in the different domains of the fully-folded 
protein. 

Beyond these key core residues, however, different Ig domains 
are still able to maintain similar tertiary structures in the face of 
dramatic differences in their primary sequences. This chiefly 
derives from the fact that the identity of a particular residue at a 
particular position is not nearly so important to proper Ig domain 
folding as is the character of the particular residue at that posi- 
tion. In other words, as long as amino acids with side chains 
compatible with p-pleated sheet formation are present in the 
proper locations and those necessary to terminate p strands are 
similarly in the correct places, their actual identity appears not to 
be crucial. 

There are, of course, other residues essential to proper folding and 
functioning of Ig domains, but these are specific to particular 
domains and not common to all Ig domains within immunoglobulin. 
For example, the FR4 motif W7F-G-X-G is widely conserved, but 
only among V regions, where it serves to create a "p bulge" neces- 
sary for proper Vh/Vl dimerization. Similarly, the VH-specific (G-L- 
E-W-hydrophobic) and VL-specific (P-hydrophilic-hydrophobic-L- 
hydrophobic) FR2 sequence motifs — and their accompanying p 
bulges— are also conserved tertiary structures among their respec- 
tive subsets of Ig domains. Finally, as was discussed in secondary 
structure, V and C domains also differ in terms of their basic 
arrangement of the immunoglobulin fold such that V domains are 
composed of a four-strand (A, B, E, and D) and a five-strand (C'\ C\ 
C, F, and G) layer, whereas C domains consist of four-stranded (A, 
B, E, and D) and three-stranded (C, F, and G) p sheets. Clearly, this 
distinguishes variable and constant domains at the tertiary structural 
level as well. 

Once again, as with primary and secondary structural consider- 
ations, examination of tertiary structure can be facilitated by dis- 
tinguishing between V and C domains. In the case of variable 
regions, whether from heavy or light chains, obviously no single 
protein structure will ever fully suffice in describing the entire 
group. This is because each domain, Vh or Vl, is in effect a new 
structure, and unless solved crystallographically, can only be pos- 
tulated. Nonetheless, two broad generalities as pertains to V region 
tertiary structure are evident. First, the similarity between two dif- 
ferent V domain structures tends to closely parallel their related- 
ness at the genetic level - that is, one can reasonably predict that two 
different V H structures will be more similar to each other than to a 
Vl structure, two V H domains belonging to the same clan will be 
more similar than to one from a different clan, and so on. Color- 
plates 1 and 2, which compare molecular models of FR1 regions 
from antibodies belonging to the same and different clans, are par- 
ticularly compelling in this regard. While undoubtedly exceptions 
exist where two proteins are genetically similar but diverge at a few 
key residues with important structural consequences, in most 
instances this rule is valid for extrapolating the tertiary structures 
of unsolved V domains. In this context, it is perhaps also important 
to note that the FR1 region utilized to assign clan identity is solvent 
exposed and distal to the antigen-binding site, while the FR3 region 
which correlates best with family designation is immediately adja- 
cent to the binding site, and capable of affecting its conformation 
and even interacting with antigen directly (42). In light of several 
reports linking over-representation of certain families in the reper- 
toires reactive against particular antigenic specificities, the tertiary 



structural predictions available by this means perhaps take on 
added significance. 

The second pervasive trend which is apparent upon scrutiny of 
V region tertiary structures is the tendency of the antigen-binding 
site to represent a "nested gradient of antibody diversity" (42). 
Recall, investigations into the very first antibody proteins identi- 
fied nonconserved "hypervariable" regions and reasonably well- 
conserved "framework" regions, which were in turn postulated to 
correspond to the antigen-binding and structural foundations, 
respectively, of the molecule. These hypotheses proved correct, as 
the FRs were demonstrated to coincide with the P strands, and the 
CDRs were shown to derive from the variable loops that intercon- 
nect the strands. Moreover, current understanding of antibody 
structure makes it possible to recognize that, as a general rule, the 
most variable residues of an immunoglobulin V region localize 
immediately proximal to the antigen-binding site, whereas those 
that are most conserved tend to be distant to that site. Colorplate 
3 provides one such example of this concept. Thus, the three- 
dimensional context in which amino acids interact to create a plat- 
form for ligand-binding (paratope) diverges dramatically from 
antibody to antibody. 

There are several factors that influence the tertiary structure of 
the paratope itself, and their composite effect can be complex. 
First, sequence variation of two types in the loops can profoundly 
alter antigenic specificity and affinity. CDRs vary considerably in 
length, both as a function of V gene usage (affecting primarily 
CDRs 1 and 2) and as a consequence of junctional diversity (affect- 
ing only CDR3). Second, CDRs obviously differ significantly in 
terms of their sequence composition, due once again to gene usage 
and junctional diversity, and in addition to somatic hypermutation. 
In this way, diversity-generating mechanisms target amino acid 
variability to the CDR loops where they are most apt to change 
both the physical shape and chemical nature of the combining site. 
Also, because FR residues near CDR boundaries frequently inter- 
act with antigen directly (59), alterations in these positions affect 
structural variety as well. Even glycosylation of CDR asparagine 
residues has been implicated in changing loop conformation and 
antigen binding (60,61). Conformational variability can also play a 
critical role in diversifying the paratope surface, as CDR loops 
have been shown to interact extensively with adjacent FR amino 
acids and with each other (62,63). These studies, and failed 
attempts to engineer antibodies by simply swapping CDRs onto 
different FR backbones without appreciably affecting affinity for 
antigen, have further demonstrated that while V regions can be 
conveniently dissected into FRs and CDRs at the primary structural 
level, in actuality these elements cooperate to facilitate antigen- 
binding rather than acting as discrete elements. 

While the limitless capacity of the immune system to generate 
new variable region domains makes impossible the absolute eluci- 
dation of all potential V region structures, the relatively smaller 
number of constant regions has allowed reasonable progress to be 
made in terms of assigning definitive tertiary structures to the C 
region domains. To date, x-ray diffraction analysis has resulted in a 
high resolution structure for only the Fc fragment of IgGl (11). An 
a-carbon backbone of this structure is shown in Fig. 15A. Addi- 
tionally, one whole immunoglobulin (an unusual IgGl molecule 
with a hinge deletion) has been crystallized (64). Other Fc iso- 
types, based on the FC7I paradigm, can be modeled (see Fig. 15B) 
using sequence homology with IgG and energy minimization cal- 
culations (65). Otherwise, almost all three-dimensional studies 
have utilized Fab fragments or Fv fragments, often produced in 
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FIG. 15. Stereoviews of the a-carbon backbones of (A) Fcyl and (B) Fee. In both cases, the C-terminal domains 
dimerize, but the penultimate domains (0,2 or Cc3) contact their counterparts only via their carbohydrate moieties. 
The Fee structure is a prediction modeled on the lgG1 crystal. As IgG has no C E 2 domain equivalent, the C c 2 struc- 
ture is less reliable conjecture than is the rest of the molecule. (From refs. 12 and 65, with permission.) 



bacteria. Of note, the IgG Fc crystal has also been solved as a co- 
crystal with staphylococcal protein A (SPA) (12). Importantly, this 
structure reveals the binding of SPA between both the Cy2 and Cj3 
domains. This is contrary to the notion originally promulgated that 
single domains would perform IgG functions. Another example of 
sharing of function between domains is the Fca receptor binding 
site on human IgA, which also involves both C a 2 and C a 3 (66). 
Surprising results such as these demonstrate the need for continu- 
ing investigation — by both crystallographic and other means — into 
the structural intricacies of all of the immunoglobulin isotypes. In 



any case, in the intervening time, the general properties of 
topology, though proven only for IgG, are reasonably general^ 
to the other isotypes, since the residues involved in the various c( 
tacts are largely conserved. 

As for the V region domains, constant region domain structure 
governed by general principles that tend to hold true in most exa 
pies of C region domains thus far studied. The overriding c< 
sideration inthis regard, as for the V domains, concerns the re 
tive concentration of variability outside the p strands. C regie 
conserve a much higher percentage of residues from domain 
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COLORPLATE 1. Three-dimensional models of immunoglobulin FR1 regions. Space-filling representations of 
amino acids 6-24 are displayed with orientation such that the CDRs would be above, and the Ch1 domain below, 
the models. Amino acid residues 6, 9, 1 2, 1 3, 1 6, 1 9, and 23 are colored yellow for reference. In (A), antibodies deriv- 
ing from each of the three clans — clan I HyHEL-5 (/efl), clan II NEW {middle), and clan III KOL {right) — are presented.. 
Clearly these structures differ in their structural characteristics. In (B), three clan III antibodies — human Vh3 family 
KOL (/e/r), murine V H T1 5 family MCPC603 {middle), and murine VhX24 family J539 (right) — are compared. Note the 
similarity of these three FR1 loops relative to those in part (A). (From ref. 41, with permission.) 




COLORPLATE 2. Superposition of FR1 regions from antibodies of (A) different, and (B,C) the same, clans. Stick 
diagrams of amino acid residues and their respective side chains are overlayed to facilitate comparisons. In (A), clan 
I HyHEL-5 and clan II NEW are superimposed on clan III KOL. These molecules can be seen to vary significantly In 
(B), clan III antibodies MCPC603, J539, and KOL are modeled. Note that the agreement between structures even 
extends to side-chain sizes and orientations. In (C), clan I R19.9 antibody is superimposed on clan I HyHEL-5 with 
simitar results. (From ref. 41, with permission.) 
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COLORPLATE 3. The antigen-combining site is the product of a nested gradient of diversity. Antibody sequence 
variation is mapped on the template of the surface of the antibody POT using a scale of blue (highly conserved) to 
red (highly divergent). The V* domain is to the left of each model, and the V H domain is on the right The most highly 
variable CDR3 loop of the V H is depicted as a gray ribbon in the center of the diagrams; the highly variable V c CDR3 
is not shown in these representations. In (A) t germ-line diversity is displayed. In (B), diversity introduced by somatic 
hypermutation is presented. In (C), the sum of these two diversities is depicted. Note that in all cases, variation is 
predominantly restricted to the antigen-binding site, and diversity is highest in close proximity to the V H CDR3. In (D), 
residues that have been demonstrated to make direct side-chain contacts with antigen in 21 separate crystals are 
plotted on the same blue (zero contacts) to red (up to 16 contacts) scale. Thus, the presence of diversity and the ten- 
dency to contact antigen are intimately related. (From ref. 58a, with permission.) 



COLORPLATE 4. Ribbon diagram of the ct-carbon backbone of an immunoglobulin Fv fragment. The heavy chain is 
Sown in Wue and^e «ght chain is represnted in violet The invariant cysteines, tryptoph^^ 
atm^core of the V„ and V L domains are shown in yellow. Orientation of the Fv .s su<* ,nd,n9 S,t9 

is at the top and the ChI/Cu domains would lie at the bottom, of the plate. (From ref. 42, with perm.ss.on.) 




COLORPLATE 5. Model of the lysozyme-antilysozyme antibo 
(Fab D1.3) complex. The lysozyme molecule is depicted in gret 
and its gtutamine-1 21 residue in red. The D1 .3 heavy chain is sho* 
in blue and the light chain in yellow. In (A), the complex is seen 
it was in the crystal— as a blunt, end-to-end interface of the two m 
ecules In (B). the two proteins have been separated to demonstn 
their structural complementarity. In (C), the molecules have be 
rotated toward the viewer to allow visualization of important cont 
amino acids (now portrayed in red, Gln-121 in violet). (From i 
109a. with permission.) 
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COLORPLATE 6. Model of lysozyme and three antilysozyme com- 
plexes. The Fab D1 .3 (see Colorplate 5) is included for reference 
along with two additional antibodies: HyHEL-5 (/©/if) and HyHEL-10 
{below) Molecules are shown as a-carbon backbones except for col- 
ored van der Waals surfaces involved in binding. (From ref. 110 with 
permission.) 
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COLORPLATE 7. Ribbon diagrams of CD4 D1D2 (red), CD4 D3D4 
(green), and a CD8o/ot homodimer (one subunit ye//otv, the other blue). 
Note the continuous p strand that links domains D1 to 02 and D3 to D4 
in CD4. This causes the 01 02 and 03D4 segments to be rigid struc- 
turally. (From ref. 207, with permission.) 
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domain-across immunoglobulin class and even across species- 
than do individual V regions (refer back to Figs. 10-12), but even 
so stretches of relatively lower identity can be locahzed to distinct 
parts of the protein. Even though C domains are not formally delin- 
eated into framework and hypervariable regions the tenets used to 
classify the subdomains of V regions are still applicable when 
discussing C regions. In Fig. 10A, for example, which compares 
different human C„l domains, note the areas of not only lower con- 
servation but also where gapping is necessary to preserve align- 
ment. Without exception, these regions are most be ™*" 
strands, especially the loops connecting strands 4-1 and 4-2 (A and 
B) 4-2 and 3-1 (B and C), 3-1 and 4-4 (C and D), 4-4 and 4-3 (D 
and E), and 4-3 and 3-2 (E and F). Conversely, the areas of highest - 
conservation are found within the p strands (especially 4-1 4-2 
3-1 4-3 and 3-2), where, in addition to the residues needed for the 
intradomain disulfide linkage present in all Ig domains, the ammo 
acids necessary for main-chain folding, stabilization, and dimer- 
ization of the domain reside. 

Therefore, as was the case for V regions, the least conserved seg- 
ments of antibody C domains coincide with the loops that inter- 
connect the different P strands in the fully-folded protein In the 
case of C region domains, this refers to divergent loops .found at 
each end of the Ig domain, unlike V regions, where the CDRs are 
clustered at one end of the domain. Not surprisingly, these loops 
are also where the preponderance of functional interactions have 
been localized in binding studies. For instance, the binding site on 
IeG for the FcyRI receptor is located near the hinge region in just 
such a loop of the CJ. domain. Likewise, the previously mentioned 
binding of SPA with IgG Fc involves similar loops in both the 0,2 
and C3 regions (12). In another example, although no crystal 
structure is available for IgA, by modeling the IgA sequence on the 
IkG Fc three-dimensional structure, an exposed loop in the c a 3 
domain is predicted to be the binding site for the polymenc 
immunoglobulin receptor. •<*■„,.„„, r 

Thus, the areas of greatest divergence between different C 
domains are also those implicated in mediating the Afferent bio- 
logical effects that distinguish one class or subclass of antibody 
from another. It is fair to speculate that, like the pressures driving 
the evolution of diversity-generating mechanisms in V regions, 
similar forces have used the template of the C region Ig domain to 
select for a variety of distinct binding capabilities and functional 
attributes, and have utilized a similar region of the domain for these 
purposes. In a manner analogous to that seen for variable region 
tertiary structure, those parts of the C region domain that are most 
malleable structurally are the very same selected evolutionary for 
the acquiescence of new biological characteristics. 

Quaternary Structure— The Immunoglobulin 
Monomer 

Although, as stated previously, immunoglobulin domains are 
each capable of folding independently into stable tertiary globular 
structures, neither individual Ig domains alone nor entire heavy or 
light chains are ever encountered under normal physiological cir- 
cumstances. Rather, the simplest form of this protein that occurs 
naturally is that of the immunoglobulin monomer schemat.cal y 
depicted in Fig. 1. The complete "monomeric" antibody molecule 
is actually a four-chain dimer of a heterodimer covalently linked by 
multiple interchain disulfide bonds. In almost all cases, the heavy 
and light chains are joined by a single cystine to form a halt- 



monomer" with one complete antigen-binding site and one or 
more disulfide bonds in the hinge regions (or h.nge domains) link 
the two heavy chains to form the bivalent tetramer. 

Figure 16 represents in two dimensions a member of each of the 
five classes of human immunoglobulin. Notice that &e pnmary 
differences between these structures are the presence of the extra C 
domain in the IgM and IgE isotypes and the number and Pi*™ 1 
of disulfide linkages and carbohydrate derivatives among the dif- 
ferent molecules. Like the intrachain disulfide bond considered in 
tertiary structure, the interchain cystine attaching heavy and light 
chains is highly conserved. The cysteines part.c.pating in this bond 
are located at the N-terminal end of C„l and the C-terminal end of 
C, IrGI is an exception where the cysteine donated by the heavy 
chain is found at the carboxyl end of C„l (55). Another excepUon 
to the typical bonding pattern is found in the A2m(l) allotype of 
IgA2. Distinctively, this particular isotype utilizes H-H and L-L 
dWlf.de pairing to stabilize the four chains instead of the usual 
H-L linkage (67). The interchain cystines joining the heavy chains 
are more variable in both number and position between different 
isotypes. Generally, H-H bonds are formed between the h.nge 
regions or, in the cases of IgM and IgE, analogous pos.tions in Q.2 
or C E 2 domains, respectively. In addition, both IgM and IgA have 
two additional half-cystines that have dual roles, depending upon 
whether the antibody is incorporated into a polymenc form ot 
immunoglobulin. An extra cysteine near the C-terminus is involved 
in a disulfide bond occurring during J chain-mediated polymeriza- 
tion; another half-cystine in the C M 3 or C„2 domain forms an inter- 
subunit cystine in the case of IgM or IgA multimers. The disulfide 
bonds formed by these cysteines in the monomenc forms of IgM 
and IgA are either interchain (u. and a) or intrachain (a only>. 

Once interchain disulfide bonds have cemented the four chams 
into a complete immunoglobulin monomer, the quaternary struc- 
ture of these molecules can once again be best understood using a 
domain-by-domain analysis to examine the entire protein. Recall 
that at this structural level, individual domains interact in such a 
way that an antibody actually consists of a series of dimenc mod- 
ules (reviewed in ref. 27). These dimerized domains define the 
smallest structural and functional units of native immunoglobulin 
as demonstrated by proteolysis and x-ray diffraction studies. 
Accordingly, these modules will be rev.ewed in this context, 
beginning with the most ammo-terminal domain pan-, the Fv frag- 
ment — or Vh/Vl dimer. 

The Fv (see Colorplate 4) is the variable part of the Fab frag- 
ment, and as such, constitutes the minimal antigen-binding unit of 
an antibody. Likely due to this specific functional necessity, V 
domains dimerize in a manner unlike the strategy employed by all 
oLr immunoglobulin domains. In P-pleated 
amino acid side chains protrude on alternating sides at right angles 
to the plane of the sheet. In most proteins, one side of the shee 
packs against another part of the protein (i.e. it ,s 
Ld the other face of the sheet is exposed to solvent <*<*>P^>; 
fading to a sequence of alternating hydrophobic-hydrophil.c 
eSes. In immuVioglobulin domains that dimerize, however, this 
alternating pattern is broken by one of the domam's two P sheets^ 
The sheet Lt makes up the dimerizing face must interact w.th 
bom the other P sheet in its own domam and the dimenzation face 
of the adjacent domain. Thus, hydrophilic residues are replaced by 
hydrophobic side chains that support the 

the feature of breaking the alternating hydrophil.c.ty pattern .s 
common to both types of Ig domains that form dimers V domua. 
interact in a fashion not only specific to V„ and V u , but also— at 
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FIG. 16. Schematic representation of the five human immunoglobulin classes. Positions of disulfide bonds and gly- 
cosylation are shown for each antibody. (From ret. 66a, with permission.) 



the time it was first recognized — unique among all known protein 
structures (28). Unlike C domains, V domains utilize the five- 
stranded p-pleated sheet (recall these are unique to V domains) as 
a dimerization surface (68). The actual Vh/V l interface consists of 
the four strands C'-C-F-G on each V domain. 

This singular dimerization tactic has profound structural reper- 
cussions. In the section detailing tertiary structure, three different 
P bulges were cited: one common to all V regions found in strand 
G, one specific to V H domains located in strand C, and a third 
unique to V L also in strand C Note that all of these bulges occur 
in strands at the edges of their respective P sheets. Normally, p 
sheets pack such that residues in the middle strands form most of 



the contacts between layers. On account of these conserved P 
bulges, however, V domain edge residue side chains protrude into 
the dinner's interior, preventing a close association of Vh and V L . 
The loose packing of the Fv has the effect of creating a 
hydrophilic groove into which small molecules can fit. This 
groove, which is primarily lined by residues of the HVRs, and the 
remaining CDR loops at the end of the V regions form a potential 
antigen-binding site whose surface area is approximately 2000 to 
3000 A 2 (69). 

Many of the residues consistently responsible for Vh/Vl inter- 
domain contacts have been localized. About half of the hydropho- 
bic core contacts are formed between FR2 of one chain and FR4 of 
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the other. The majority of remaining interactions involve CDR3 of 
one chain and the FR2 and/or CDR3 of the complementary 
domain. Overall 12 to 21 V L and 16 to 22 V H residues participate 
in interchain stabilization (69). Given the extensive contribution of 
HVRs to these interactions (28), one might expect that the affinity 
of H-L pairing might also be variable. Nonetheless, a number of 
studies using heavy and light chains from different individuals, or 
even different species, have demonstrated the capacity of heterolo- 
gous H-L pairs to form stable associations. This implies that the 
conservation of the basic structural features of Vh and Vl domains 
has persisted throughout evolution, or at the very least not diverged 
to the point where the ChI and Cl elements are unable to anchor 
productive interactions between these disparate entities. 

Between the Fv and Fb (C H l/C L dimer) fragments, a short 
polypeptide stretch exists that is vital to both the Fv's ability to bind 
antigen productively and to the ability of Ch1/Cl domains — and all 
constant domains C-terminal to them — to dimerize properly. This 
region, comprising the carboxy-terminal amino acids of the V 
region contiguous with the N-terminal residues of the C region, 
connects Fv to Fb in the complete Fab fragment and is known as 
the elbow peptide. Collectively, the two elbows of an immunoglob- 
ulin Fab are also referred to as the switch peptide. Several Fab crys- 
tal structures have demonstrated the switch to be a flexible segment 
permitting considerable bending between the V and C domains 
(70). This is thought to be important in allowing Fabs to bind epi- 
topes of varying spatial arrangement. An equally important feature 
of the individual elbow peptides is the fact that they make possible 
a remarkable 180-degree rotation in the quaternary structure of 
antibody domains that is essential for the correct orientation of all 
C domains in order for dimer formation between them (8). 

These 1 80-degree rotations occur at the elbows between Vh-Ch1 
and Vl-Cl, and are necessary to properly position the C regions. 
The most N-terminal C domains, ChI and C L , are then able to com- 
bine to form the Fb fragment. C H 1 and C L are prototypes for the 
C type Ig domain. Like all C domains, they lack the C and C" 
strands present in the V domain five-stranded sheet. Like V 
domains, they also break the alternating hydrophilicity pattern in. 
one of their two p sheets, but in the case of these domains, this 
occurs on the four-stranded face of the immunoglobulin fold (A-B- 
E-D) instead. As a result — and due to the permissive rotations at 
the elbows — C H 1 and C L utilize the opposite (relative to V H and 
V L ) sides of their domains to dimerize (68). The less-polar residues 
at the core of these dimeric C modules are conserved in both ChI 
and C L across species (8-10,12), and in this case (unlike V 
domains), they tend to reside primarily in the middle strands (B and 
E), where they mediate a much "tighter" association. Conse- 
quently, the Fb is often perceived as a compact anchor for the V 
domains, forming a stable platform upon which antigen binding 
can occur. Fb, together with the hinge, also serves as a spacer 
between the Ag-combining site and the bulky Fc region. 

Between the Fab and Fc regions, immunoglobulin hinges (or 
extra domains) are critical determinants of overall antibody struc- 
tural and functional properties. Structurally, hinges are extended 
segments of dimeric peptide held together by one or more disulfide 
bonds and dominated by prolines, serines, and threonines. This 
amino acid composition gives hinges flexibility and gives the Fab 
arms of an antibody flexional (71) and torsional (72) mobility. 
Hinge flexibility allows the Fabs to conform to the arrangement of 
epitopes in order to bind bivalently — presumably giving an anti- 
body greater avidity and versatility. The degree of flexing permit- 
ted correlates strongly with hinge length between the end of ChI 



and the first interheavy disulfide bridge (the il upper hinge") 
(71,73); thus IgG3 is more flexible than IgGl. Hinge length and 
flexibility also reduce the steric barrier that Fabs may present to the 
access of Ch2 by other molecules. For example, while normal 
human IgGl activates complement effectively, the hinge-deleted 
variant IgGl paraprotein Meg is unable to fix complement be- 
cause the Fabs rest too close to the Clq-binding site on C H 2 to allow 
interaction. 

The hinge, being an extended peptide, is the most proteolytically 
labile part of an immunoglobulin; recall that the early studies that 
resulted in the understanding of Fab and Fc relied on proteolytic 
digestion of the hinge. For instance, the 5 isotype, with its long, 
charged hinge, is very susceptible to proteolysis, which may 
explain its short serum half-life (74). This issue is of critical impor- 
tance to IgA, which serves its primary function at mucosal surfaces 
where proteases from bacterial and host sources are prevalent. For 
example, the ccl hinge contains five carbohydrate attachment sites 
within a stretch of only 17 amino acids (75), rendering IgAl resis- 
tant to cleavage by intestinal proteolytic enzymes. However, several 
strains of bacteria secrete proteases that specifically target the al 
hinge. Presumably as an evolutionary consequence, the hinge of 
IgA2 has undergone a 1 3-amino acid deletion, restoring its resis- 
tance to this second form of proteolytic challenge (53,76,77). Sim- 
ilarly, structural features unique to hinges of each of the different 
isotypes may have evolved so as to confer their respective 
immunoglobulins with specific functional characteristics. 

C-terrninal to the hinge, the Fc' region resides. In the cases of 
IgG, IgA, and IgD, the Fc is a dimer of two Ch2-Ch3; in IgM and 
IgE it consists of paired C h 2--Ch3-Ch4 domains. Structurally, Cn/ e 3 
is equivalent to Cy2, and Qi/ e 4 is homologous to Cy3. As has been 
described above, the vast majority of sites that define the function 
and physiology of a particular isotype map to the Fc region. The 
First striking quaternary structural feature of the Fc is that Cy2 (and 
its structural homologues) fail to dimerize; Analysis of these 
regions demonstrates that Ch2 domains possess a hybrid structure 
intermediate between V and C domains. Whereas V domains are 
five- strand/ four- strand sandwiches (dimerizing on the five-strand 
face) and C domains are three-strand/four-strand sandwiches 
(dimerizing on the four-strand face), Cy2 is a four-strand/four- 
strand sandwich. Moreover, substitutions present in outward-point- 
ing side chains on both sides of the domain prevent dimerization 
along either face (11). Another mixed feature observed is that the 
P strands of Ofl are of lengths longer than those of V domains, yet 
shorter than those of either Ql or 0,3 domains. Finally, all Ch2 
domains are derivitized by an N-linked oligosaccharide near the 
middle of the domain (refer back to Fig. 1), except C a 2, where it is 
found nearer to the carboxy-terminal end of the domain. Hydrogen 
bonding between these sugars serves as the only contact between 
Ch2 domains. Moving C-terminally, another important distinction 
concerning Ch2 is that longitudinal contact with the Ch3 domain 
(about 340 A 2 surface area/domain) prevents little interdomain 
bending (12), unlike the flexible elbows between Fv/Fb and the 
hinge separating Fab and Fc. 

The final domains of the Fc, 0,3 and its structural equivalents, 
pair in the manner described for ChI and Cl. Studies using limited 
proteolysis, reduction, and denaturation originally designated this 
fragment, a dimer of C H 3 domains, as pFc'. C H 3 domains, like the 
Fb, dimerize with tight association between them (1100 
A 2 /domain), using the four-stranded faces of each domain (11). 
Also like C H 1 and Cu all C H 3-domain isotypes show conservation 
of core residues involved in this pairwise domain interaction. 
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Structurally, the only feature that makes a marked distinction 
between the different C H 3 domain homologues is the presence of 
the 18-amino acid "tail-piece" that exists at the carboxy-termini of 
the C a 3 and C H 4 domains. This sequence is important to polymer- 
ization and will be discussed further in the section on higher-order 
immunoglobulin structure. Taken together, the Fc can thus be 
thought of as an approximation of the Fab, having two pseudo-V 
regions (the Ch2s) at its amino-terminus, and a module of C- 
terminal constant regions (the Ch3s) dimerized in the mode char- 
acteristic of classic C-type domains. 

Although similar in many ways at the protein level, one of the 
properties that most notably differentiates the quaternary structures 
of the five classes of Fc is their pattern of glycosylation — with sig- 
nificant functional ramifications. For instance, while the oligosac- 
charide moiety of the IgG molecule accounts for only 2% to 3% of 
its mass, it has been shown to be essential for optimal activation of 
effector mechanisms leading to the clearance and destruction of 
pathogens (78-80). As introduced above, all human antibody mol- 
ecules of the IgG class have N-linked oligosaccharide attached at 
the amide side chain of Asn 297 on the P-4 bend (between p strands 
D and E) of the inner face of the Ch2 domain of the Fc region (8 1). 
This oligosaccharide moiety is of the complex biantennary type, 
having a hexasaccharide "core" structure (GlcNAc2Man3GlcNAc) 
and variable outer arm "noncore" sugar residues, such as fucose, 
bisecting N-acetylglucosamine, galactose, and sialic acid (see Fig. 
17). In all, a total of 36 structurally unique oligosaccharide chains 
may be attached at each Asn 297 residue. It is likely, but not cer- 
tain, that the precise fidelity of this glycosylation is important. 

The site for this Ch2 carbohydrate is a conserved feature for all 
mammalian IgGs, and glycosylation occurs at a homologous posi- 
tion in human IgM, IgD, and IgE molecules. As stated above, IgA 
is also glycosylated within its C a 2 domain, but at a site further 
C-terminal. Human IgM, IgA, IgE, and IgD molecules also bear 
additional N-linked oligosaccharide moieties attached to the C 
domains of their heavy chains. Furthermore, IgAl and IgD pro- 
teins also possess multiple O-linked sugars in their extended hinge 
regions, attached to the hydroxyl groups of serine and threonine 
residues. Glycosylation, in one form or another, is in fact charac- 
teristic of all heavy chain C regions and remains one of the most 
active areas of research in immunoglobulin structural biology. 

The structural and functional consequences of Fc oligosaccha- 
rides have begun to be assessed experimentally by comparison of 
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FIG. 17. Schematic representation of N-linked sugars attached to all 
Ch2 domains. The core carbohydrate moiety of the complex form of 
oligandosaccharides is represented by the sugar residues in open 
type. The possible outer-arm residues are in parenthesis. All possi- 
ble combinations are observed. SA, sialic acid; G, galactose; GN, 
N-acetylglucosamine; M, mannose; F f fucose. Attachment of oliosac- 
charide occurs on the amide side chain of the Asn-X-Ser/Thr sequon 
(X * Pro). The Ser/Thr residue forms hydrogen bond(s) with the 
amide group in order to activate it for attachment to the primary N- 
acetylglucosamine residue of the dolichol intermediate (catalyzed by 
oligosaccharyltransf erase). (From ret. 81a, with permission.) 



glycosylated and aglycosylated forms of IgG. The latter is ordinar- 
ily generated by growing IgG-producing E. coli in the presence of 
tunicamycin (a glycosylation inhibitor) or by protein engineering 
of the carbohydrate acceptor sequence. One characteristic appar- 
ently affected by the sugar moieties found on antibodies is the 
duration of these proteins* existence in vivo. Studies of the blood 
clearance of glycosylated and aglycosylated mouse/human 
chimeric IgG I molecules in mice reveal accelerated clearance for 
the aglycosylated form despite similar half-lives. Additionally, 
galactosylated and agalactosylated IgG have been investigated to 
determine the role of outer-arm sugars in complement (Clq)- 
mediated lysis. The agalactosylated form, produced following 
exposure to p-galactosidase, has an observed two fold lower activ- 
ity than the galactosylated form (82). Related studies have sub- 
stantiated these conclusions (83). 

The vital importance of correct glycosylation was further pro- 
vided by a study using a human/mouse chimeric IgG I molecule pro- 
duced in yeast cells, and anticipated to have high mannose-type 
oligosaccharide attached at Asn 297. The yeast IgGl product was 
unable to activate Clq to trigger human complement-mediated lysis 
of targets, while the same chimeric IgGl construct expressed in 
rodent cells (and therefore glycosylated normally) was effective in 
that regard. A direct role for oligosaccharide in activating the com- 
plement cascade is apparent with mannan-binding protein, a lectin 
that can function as a surrogate CI component The specificity of 
mannan-binding protein is for mannose and N-acetylglucosamine 
residues, and it has been shown that it can access and bind to termi- 
nal N-acetylglucosamine residues exposed on agalactosyl IgG (84). 

Studies utilizing the three types of human Fey receptors (FcyRI, 
FcyRII, and FcyRIII) have also attested to the significance of 
oligosaccharide modifications on antibodies. The IgG subclass 
specificity of the FcyR suggests that recognition is correlated with 
the presence or absence of carbohydrate derivatives. This conclu- 
sion is supported by the demonstration that aglycosylated human 
chimeric IgG3 has a reduced interaction with all three Fey recep- 
tors. Moreover, at the level of function, while haptenated RBCs 
sensitized with this same aglycosylated IgG3 antibody could still 
trigger superoxide production by U937 cells, higher levels of sen- 
sitization were required compared to normally glycosylated IgG3. 
The aglycosylated IgG3 also was not recognized by human FC7RII 
expressed on K562 and Daudi cells, had reduced rosette formation 
(mediated through FcyRII expressed on human NK cells), and 
essentially abolished antibody-dependent cellular cytotoxicity 
(85,86). Clearly these conclusions illustrate that proper Fc glyco- 
sylation is — at least in some cases — necessary for normal struc- 
tural recognition and biologic function of immunoglobulins. 

Glycosylation is potentially important outside of the Fc region as 
well. It has been estimated that up to 30% of polyclonal IgG mole- 
cules are also derivitized by oligosaccharide within the Fab region. 
Since there are no known sites for sugar attachment in Cjl or Cl, 
this is most likely in the V regions. Of interest in this regard, an 
analysis of the DNA sequences of 83 functional human germline V H 
gene segments revealed five that encoded potential glycosylation 
sites. Some, but not all, of these are known to be glycolyslated. In 
one study of protein and cDNA Vh and V L sequences, about 25% 
had potential glycosylation sequences, some of which had arisen as 
a result of somatic mutation and antigenic selection (87). In most 
circumstances of V region glycosylation studied thus far, the 
oligosaccharide moiety does not contribute directly to ligand bind- 
ing, but can exert a subtle influence on protein tertiary and quater- 
nary structure that is essential for full activity of the antibody. Thus, 



Structure and Fun 



in of Immunoglobulins 



/ 57 



oligosaccharides occur in many places on immunoglobulin mole- 
cules and can affect antibody characteristics as disparate as antigen- 
binding and the assortment of different Fc-associated functions. 

Higher-Order Immunoglobulin Structure — 
Polymeric Immunoglobulin 

One of the most fascinating structural attributes of immunoglob- 
ulin is the ability of two classes of heavy chain, IgM and IgA, to 
form higher order multimeric complexes. IgM and IgA antibodies 
do not always form polymers, however, monomelic a and p. iso- 
rypes exist in forms analogous to those for the y, 8, and e isotypes, 
as well. In addition, polymeric immunoglobulin (pig) can typically 
come in a variety of manifestations. The most common forms of 
these molecules are dimeric (IgA) and pentameric (IgM), although 
other polymers have also been described. Electron micrographs of 
murine pentameric and hexameric IgM and human dimeric and 
trimeric IgA molecules are presented in Fig. 18. Multimenzation 
obviously increases the number of potential antigen-binding sites 
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FIG 18 Electron micrographs of immunoglobulin multimers. In (A), 
a murine IgM pentamer and interpretive diagram {top) and a murine 
IgM hexamer and diagram {bottom) are displayed. In (B), a human 
IgA dimer {top) and trimer {bottom) are presented. All magnificaUons 
are x60O,000. (From ref. 87a, with permission, and courtesy ot K. h. 
Roux.) 



available on the antibody, and this increase in valence translates into 
enhanced avidity for polymeric, low-affinity epitopes. This is par- 
ticularly beneficial for antibodies of the n and a classes, which 
serve as the first line of defense at mucosal surfaces where 
encounter with this type of pathogenic target (i.e., cell-wall poly- 
saccharides) is most frequent. Moreover, IgM, which is the antibody 
characteristic of primary humoral responses when affinity matura- 
tion has not yet occurred, is reliant upon this increased avidity to 
mediate its functional responsibilities. In addition to raising the 
apparent affinity for antigen binding, polymeric IgM* juxtaposition 
of several Fc regions in close proximity also likely plays a role in its 
efficacy in fixing complement components of the classical pathway. 
Mechanistically, assembly and secretion of pig involves the covalent 
linkage of concatomers of prototypic immunoglobulin monomers, 
and two accessory proteins termed J (joining) chain and secretory 
component (SC) play key roles in these processes. 

J chain is a 137-amino acid polypeptide synthesized by pig-pro- 
ducing plasma cells. J chain covalently interacts with one or more 
cysteines of immunoglobulin monomers undergoing multimenza- 
tion (88) It is a proteolytically labile molecule with a high content 
of negatively charged amino acids and has eight cysteine residues 
involved in both intra- and interchain disulfide bonds (89). The 
high level of conservation between J chains of human (90), murine 
(91), rabbit (92), and even amphibian (93) origin implies that there 
is a powerful selective pressure to maintain J chain structure. A 
report identifying J chains in invertebrates which have no known 
correlate to antibody proteins (94) also indicates that the J chain 
probably performs some basic protein function that predates its 
eventual development of the ability to interact with immunoglobu- 
lin In any event, structural studies imply that despite the fact that 
J chain lacks any significant sequence homology with immuno- 
globulin it likely folds into a p-barrel structure similar to that of an 
immunoglobulin fold (95). Besides the intrachain cystines which 
stabilize the J chain itself, additional cysteine residues form disul- 
fide bridges to the tailpiece of one or more immunoglobulin 
monomers during multimer assembly (89). Although the actual 
details of polymerization have not as yet been elucidated, it is 
known that the 18-amino acid C&a tailpiece and its penultimate 
cysteine residue are necessary for the process (52,96). 

The stoichiometry of multimer assembly is such that one J chain 
is present per polymer (whether dimer, trimer, pentamer, etc.). 
While not always a part of plgM molecules, J chain is probably 
absolutely necessary for formation of polymeric IgA, as it is always 
present in the complex. J chain synthesis is known to be highly reg- 
ulated (97), and it is thought to be linked to the B cell* activation 
state as well (98). High levels of J chain expression have been 
shown to result in production of normal J chain-associated pen- 
tameric IgM, while low J chain synthesis results in secretion of 
hexameric IgM lacking the protein. Intriguingly, this hexameric 
IgM is actually 20-fold more potent at promoting lysis by comple- 
ment than is the usual pentamer (99). 

The second accessory molecule associated with the secretion of 
multimeric antibodies actually derives from another protein 
belonging to the IgSF-the pig receptor (pI g R)-and is not even 
made by cells of the B lineage. Secretory component (also called 
secretory piece) was initially discovered as a polypeptide found 
tightly complexed to the Fc of secreted forms of IgA and IgM 
(100); subsequently, it was recognized that SC is actually a portion 
of the larger transmembrane plgR protein (101). The entire cDNA 
sequence of the plgR reveals a protein consisting of seven 
domains- the first five are extracellular and structurally similar to 
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immunoglobulin V regions, the sixth contains a transmembrane 
segment and is partially homologous to immunoglobulin V 
domains, and the seventh contains an unrelated C-terminal intra- 
cellular domain (102). The first five domains of the plgR are in fact 
the secretory piece originally co-isolated as part of the secreted 
immunoglobulin complex. 

The plgR is synthesized in epithelial cells of the respiratory, 
gastrointestinal, and genitourinary tracts and is expressed on their 
basolateral aspect, where it binds to plgA and plgM in a high- 
affinity interaction. It is known that the N-terminal domain of the 
plgR confers binding specificity, and it is thought that both J chain 
and Fc C 0 3/C,i4 determinants are recognized by the receptor. 
Interestingly, although the precise molecular locations of plg/SC 
interaction have not been identified, there is evidence that, at least 
structurally, the site is well conserved. Studies have shown, for 
instance, that human SC binds not only human plgA and plgM 
(103,104), but also several other mammalian species' IgM and IgA 
(105), and even chicken IgA (106). This cross-reactivity, however, 
may be mediated by the J chain rather than the Fc regions. 

Regardless, following the initial interaction between domain 1 of 
the plgR and the C-terminal domain/J chain of the pig, secondary 
contact occurs between plgR domains 3, 4, and 5 with the antibody, 
consumated by formation of a disulfide linkage between the SC and 
C^JCyd. This covalent bond is between Cys 467 in domain 5 of the 
plgR and Cys 311 located in the C a 2 domain of one IgA subunit's 
heavy chain (an IgA dimer would have four C«2 domains overall) 
(107). After a stable interaction has been established, endocytosis of 
the complex occurs via clathrin-coated pits. Next, following cleavage 
between domains 5 and 6 of the plgR, the poly-Ig /SC (now formally 
termed secretory immunoglobulin) is exocytosed at the apex of the 
cell, releasing the secretory immunoglobulin onto the mucosal sur- 
face (108). Current thinking holds that SC's most important func- 
tion, outside the realm of its role as the pig-binding portion of the 
plgR, is to help protect secretory immunoglobulin in harsh 
mucosal environments. 

Structurally then, polymeric antibodies represent the pinnacle 
of complexity in terms of immunoglobulin's expansion upon the 
Ig homology domain concept. From the fundamentals of a simple 
1 10-amino acid domain with a few conserved core residues and a 
basic structural topology, an intricate molecule such as pen- 
tameric IgM (containing 70 different Ig domains of both V and C 
types, not to mention Ig domains contributed by SC!) is con- 
structed. A molecule capable of recognizing as many as ten 
(although steric constraints usually dictate less) identical specific 
antigenic determinants, and also able to mediate several different 
important biologic functions — all of which will be detailed in the 
following section. 



IMMUNOGLOBULIN FUNCTION 

Throughout this chapter thus far, many of the differing func- 
tional capacities of antibodies have already been alluded to, as per- 
tains to the identification of the specific structural determinants 
responsible for particular interactions. Still, the plethora of biologic 
activities performed by immunoglobulin is best treated as a 
separate section, in which the many and varied aspects of 
immunoglobulin function can be detailed and integrated in a phys- 
iologic context. Collectively, secreted antibodies are able to acti- 
vate both the classical and alternative complement cascades (see 
Chapter 29), transcytose across epithelial cell layers to provide a 



barrier to pathogens at mucosal surfaces (see Chapter 27), travel 
transplacental^ to confer maternal humoral immunity to the fetus 
and neonate, induce phagocytosis by macrophages and granulo- 
cytes via the process of opsonization (see Chapters 30 and 41), fos- 
ter antibody-dependent cellular cytotoxicity by lymphocytes and 
NK cells (see Chapters 17 and 31), encourage antiparasite immune 
responses by eosinophils (see Chapter 38), and promote degranu- 
lation by mast cells and basophils (see Chapters 32 and 35) — not to 
mention antibody's ability to bind and inactivate foreign antigenic 
entities directly (see Chapter 39)! 

Even this imposing list of attributes neglects to mention the myr- 
iad effects mediated by surface immunoglobulin that include, but 
are not limited to, the induction(s) of activation (see Chapter 7), 
differentiation (see Chapter 6), anergy (see Chapter 20), and even 
apotosis (see Chapter 23) of B lymphocytes, which are detailed 
elsewhere in this volume. Surface Ig on memory B cells also has 
the ability to act as a high-affinity receptor for the recognition, 
internalization, degradation, and eventual presentation of specific 
antigens to T cells (see Chapter 9). This allows memory B cells to 
act as antigen-specific antigen-presenting cells (APC), which 
makes them uniquely efficient among this class of cells. Moreover, 
emerging fields of study, such as the growing body of literature 
concerning intracellular antibodies, indicate that new functional 
capacities for immunoglobulin are likely yet to be discovered. 

As for the preceding sections on immunoglobulin structure, the 
biologic capabilities of immunoglobulin are best treated by dis- 
secting the molecule into the portions responsible for each of its 
different functional characteristics. While there are some excep- 
tions, in general, specific functions of antibodies can be ascribed to 
individual domains of the molecule. In the case of variable regions, 
this requires consideration of the two V domains (Vh and Vl) 
whose primary function is the binding of antigen. Additionally, it 
has also been appreciated that certain "superantigens" bind to the 
V domains as well. In the case of constant regions, because no 
effector properties have been linked to Cl domains, this entails dis- 
cussion of each of the heavy chain isotypes (IgM, IgD, IgG, IgA, 
and IgE), whose functional differences must be a direct result of 
their structural heterogeneity. 



Variable Region Functions 

The two V regions (either V H /V\ or V H /V K ) together form the 
variable domains of an antibody molecule and provide the speci- 
ficity for targeting the effector arms of immune response. In gen- 
eral, both V regions are needed to provide specificity and high 
affinity. There are many examples of individual variable regions 
binding antigen, but clearly, when the two chains act in concert, the 
exquisite specificity and affinity of interaction between antibody 
and foreign antogen is dramatically enhanced. 

The concept of hypervariable regions and complementarity- 
determining regions is pertinent here. In general, most of the con- 
tacts between the V domains and antigen take place between amino 
acid residues in the CDRs and the major epitopes on the antigen 
(109). However, more recent studies have documented considerable 
contact between so-called framework residues and antigen. This is 
most dramatically seen in the lysozyme-antibody crystal (see Col- 
orplate 6) where many residues (especially in the heavy chain FR3 
region) are in direct contact with the antigen (110). The generaliza- 
tion that the CDRs provide all of the contact residues grew out of 
the early work involving hapten/anti-hapten systems. When only 
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small organic haptens are the "antigen" then the CDRs can easily 
provide a "pocket" in which antigen engages antibody (recall that 
the peculiar dimerization strategy employed by V domains has the 
propensity to generate such pockets). Closer inspection of the 
lysozyme Ag-Ab complex — in particular, lysozyme residue Gin 
121 — is instructive in this regard. Gin 121 protrudes into the cleft 
between V H and V L much like haptens Fit into the groove described 
above. However, other non-groove residues still appear to provide 
the bulk of the interacting surface for this antibody. In fact, other 
antibodies that are also reactive with lysozyme (see Colorplate 6) 
have further borne out this face-to-face binding concept (59). It is 
fair to say, then, that generally when large molecules such as pro- 
teins complex with antibody, the interaction is one of two protein 
"faces'* coming together. In that instance, the notion of a pocket is 
less appropriate, and as such, non-CDR residues (especially in FR1 
and FR3) are also frequently involved. 

As with all molecular associations, antigen-antibody interac- 
tions occur only if the binding reaction releases enough free energy 
to be thermodynamically favored. The affinity of interaction is 
exponentially related to changes in free energy (see Chapter 4). 
Free energy changes are the sum of changes in both entropy and 
enthalpy, with increases in entropy and decreases in enthalpy favor- 
ing binding. Few association reactions are able to fulfill both of 
these requirements, however. Instead, a favorable change in one 
component compensates for a less unfavorable change in the other. 
When "antibodies bind their ligands, the freedom of one molecule 
to move relative to the other is lost (an unfavorable decrease in 
entropy). Stabilization of most conformational motions of both the 
epitope and the backbone and side chains of the paratope surface 
lowers entropy even further. Thus, to encourage binding to antigen, 
antibodies must attempt to limit decreases in entropy and offset 
these losses by potentiating decreases in enthalpy. At the amino 
acid level, this leads to a selection for Tyr, Trp, Ser, and Asn in 
combining sites (111,112), because these residues have lower con- 
formational freedom, and hence less entropy to lose upon binding. 
Additionally, the side chains of these residues foster the varied 
chemical interactions that drive changes in enthalpy necessary to 
promote binding energetically. 

Specifically, the antigen-antibody interaction involves a variety 
of forces, including electrostatic (the attraction between opposite 
charges), hydrogen bonds (hydrogen shared between electronega- 
tive atoms), van der Waals forces (the fluctuations in electron clouds 
around molecules oppositely polarize neighboring atoms), and 
hydrophobic forces (hydrophobic groups interact unfavorably with 
water and tend to pack together to exclude water molecules) (113). 
Of course, salt bridges and other forms of interaction also come into 
play in some specific immunoglobulin-ligand complexes as well. It 
is also important to appreciate that rarely do covalent bonds occur 
between antigen and antibody. Thus, antigen-antibody complexes 
are readily dissociated by solvents that break the above bonds, such 
as high salt, organic solvents, urea, and so on. 

Thermodynamic considerations for ligand binding are favored 
by large interacting surfaces of both antibody and antigen, which 
are packed as closely as possible. Large interaction areas exclude 
more bound water, somewhat opposing losses in protein entropies 
with gains in solvent entropy. Surprisingly then, some antigen- 
antibody complexes actually retain water molecules in their inter- 
faces. However, rather than interfering with binding, these fre- 
quently contribute to the interaction by hydrogen bonding to both 
surfaces (114). More important than entropic changes, the overrid- 
ing impetus for large contact areas of antibody and antigen is their 



ability to bring about large decreases in enthalpy. This effect is 
mediated by allowing many chemical interactions of all kinds to 
occur simultaneously between the epitope and paratope. 

A model for interaction outside the realm of typical antibody- 
antigen binding has recently come into vogue, that of the "super- 
antigen" Superantigens were first appreciated in the context of T 
cell activation. Certain molecules (particularly bacterial products) 
were found to interact with many different T cell receptors (TCRs) 
having a variety of specificities (115). Thus, superantigens were 
originally defined as intact proteins that stimulated large numbers 
of T cells by binding the V region of a specific family of Vp chains 
(the heavy chain of the TCR) outside its normal binding groove. 
Typical T cell superantigens simultaneously stimulate 5% to 25% 
of the T lymphocyte population, compared with 0.01% stimulation 
of T cells by a conventional antigen. 

More recently, this concept has been extended to B cells 
(116,117). SPA is a prototype of a B cell superantigen. Although 
SPA was known to bind to certain immunoglobulin C regions — it 
has long been used as a mitogen for human B cells — it has been 
shown that SPA also binds to certain human Vh3 -encoded antibod- 
ies (118,119). It also binds to the Fab region of murine immuno- 
globulins, particularly those of the J606 and SI 07 V H gene families 
(which belong to the same clan as human V H 3). SPA binds inde- 
pendently of D, Jh, and light chain utilization, although some light 
chains influence the extent of binding. Since the interaction is inde- 
pendent of the specificity of the antibody, and since SPA in general 
does not block antigen binding, it is considered a B cell superanti- 
gen. SPA is even able to deliver activation signals to stimulate the 
differentiation of those B cells containing V H 3-encoded receptors, 
and SPA also stimulates antibody production. More recent work 
(120) has documented that SPA simultaneously interacts with FR1, 
CDR2, and FR3 on the V H region (Fig. 19). 

Other superantigens have also been described that are able to 
bind immunoglobulin in regions apart from the traditional antigen- 
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FIG. 19. Ribbon drawing of the Fv fragment of the V H 3 antibody 
KOL. The FR1 , CDR2, and FR3 subdomains of the heavy chain (/eff) 
are juxtaposed in a manner forming a solvent-exposed face which 
allows SPA binding. (From ref. 120, with permission.) 
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combining site (based on their broad specificities). These include 
the HIV envelope protein gp 120 (121), which like SPA binds V H 3- 
encoded antibodies, and the TCR-associated molecule CD4 (122). 
Like the circumstance with antigen binding, superantigens gener- 
ally require both V H and V L domains (even though the particular 
identity of the light chain is unimportant). Individual heavy chains 
do not bind the B cell superantigens that have been described to 
date, indicating that light chains must influence their conforma- 
tions appreciably Finally, the report of a crystal structure (an IgM 
rheumatoid factor Fab complexed to its autoantigen, an IgG Fc) 
showing residues at the edge of the conventional binding site medi- 
ating interaction indicates that still more novel paradigms for anti- 
body-antigen binding possibly exist as well (1 23). 



Constant Region Functions 

Because mammalian species each utilize the same major classes 
of antibody (although their organization of subclasses differs), it is 
reasonable to presume that each isotype subserves some vital bio- 
logic function(s). Along these same lines, it should be remarked 
that even in "lower" species, where only one type or one copy of 
heavy chain gene is present, the protein product resulting from this 
element is typically heterogeneous. In other words, although fish, 
amphibians, and reptiles all possess fewer immunoglobulin iso- 
types than do mammals at the genomic level, greater than one C 
region protein is produced per gene. For example, sharks make 
both monomeric and polymeric IgM; in skate, turtle, and duck 
there are truncated and full-length versions of the immunoglobulin 
polypeptide; Xenopus immunoglobulin comes in both glycosylated 
and aglycosylated forms (51). Clearly, evolution has recurrently 
employed the strategy of adopting more than one type of antibody 



to perform the multitude of biologic responsibilities that are 
required by species for effective immunologic functioning. 

In a broad sense, Fc-mediated effector functions can be classi- 
fied into three general categories: (a) activation of complement, (b) 
interaction with effector cells, and (c) transport and compartmen- 
talization of immunoglobulins. In addition, different isotypes have 
different stabilities in vivo, such that this is an important variable as 
well. In the following sections, the five classes of human 
immunoglobulins are each discussed separately with respect to 
function. Table 2 presents a summary of key properties for each 
class of human antibody, and Fig. 20 compares the circulating 
serum levels for each of the five major isotypes. 



IgM 

IgM is the most versatile antibody and almost certainly the first 
type of immunoglobulin to have developed evolutionarily. Heavy 
chains of the u, class are the first type expressed during B cell 
development, and IgM is the isotype produced in primary immune 
responses. IgM, in the form of surface immunoglobulin, is also an 
important receptor on immature B lymphocytes and on mature, 
naive B lymphocytes. Total serum Ig consists of 5% to 10% IgM, 
and second to IgA it is the major isotype of mucosal immunity. 
Originally named due to their description as ^microglobulins, IgM 
molecules are thought to serve similar functions in all mammalian 
species. In fact, IgM-like (polymeric, having five domain heavy 
chains with large carbohydrate content, and present as a cell sur- 
face receptor on most B cells) antibodies have even been identified 
in most non-mammalian vertebrates other than the jawless fish 
(51). Unquestionably, the polymeric structure of IgM has been con- 
served in evolution, probably due to its higher avidity for antigen 
compared with that of the monomer. 



TABLE 2. Physical, 



chemical, and biological properties of human heavy chain immunoglobulin classes 

IgA 



Property 



IgM 



igD 



IgG 



Molecular form 

Number of C region domains 

Tailpiece 

Accessory chains 
Subclasses 
Molecular weight 
Carbohydrate content (%) 
Percentage of total serum Ig 
Average adult free serum level 

(mg/ml) 
Synthesis rate (mg/kg/d) 
Serum half-life (d) 
Antibody valence 
Bacterial lysis 
Placental transfer 
Mast cell/basophil binding 
Macrophage binding 
Classical complement activation 
Alternate complement activation 
Other biological properties 



Pentamer, hexamer 
4 
+ 

J chain, SC 
None 
950 kD, 1150 kD 
10 
5-10% 
0.7-1 .7 

7 
5 

10, 12 
+ 



Primary Ab 
responses; 
Secretory 
immunoglobulin 



Monomer 
3 

None 
None 
175 kD 
9 

0.3% 
0.04 

0.4 
3 
2 
? 



Unknown; Useful 
as a B cell 
marker 



Monomer 
3 

None 
G1,G2,G3.G4 
150 kD 
3 

75-85% 
9.5-12.5 

33 
23 

2 

+ 

+ 

+ 
+ 
+ 

Hallmark of 
secondary 
immune 
responses 



Monomer, dimer 
3 
+ 

J chain, SC 
A1.A2 
160 kD, 400 kD 
7 

7-15% 
1.5-2.6 

65 
6 
2,4 



, A1+.A2- 
Main secretory 
immunoglobulin 



Monomer 
4 

None 
None 
190 kD 

13 
0.02% 
0.0003 

0.016 
2.5 
2 

9 



Allergic and antt- 

parasite 

responses 
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FIG 20. Circulating levels of different human immunoglobulin iso- 
tvpes Note the log scale of the graph and that both human IgA iso- 
types are represented. (From ref. 123a, with permission.). 



The two most common forms of IgM are the membrane-bound 
monomeric form and the secreted pentamer. The cell-surface ver- 
sion of IgM serves as the antigen-specific receptor for B cell acti- 
vation, although the activation signal is actually transmitted by the 
transmembrane accessory molecules Iga and Igp (124). It is 
unclear whether the Ql4 domains of surface IgM participate in the 
interaction with the a/p heterodimer, but it is more likely that the 
important associations lie in the transmembrane and cytoplasmic 
regions of IgM that are specific to the cell-surface form (125). The 
surface form of IgM is also important in the development of the B 
cell. During pre-B stages, p. heavy chains are associated (via a 
disulfide linkage in C u l) with the "surrogate" light chains Vp«B 
(analogous to a V L domain) and X5 (a Cx analogue) (126,127). This 
complex, once again through accessory molecules, is able to trans- 
duce signals thought to be necessary for allelic exclusion of the 
other heavy chain locus and for subsequent light chain rearrang- 
ment (detailed in Chapter 5). Eventually, of course, the IgM heavy 
chains become associated with either X or K light chains. 

Polymeric IgM also has its own catalogue of functional attributes. 
IgM antibodies are the first to be secreted from plasma cells upon 
challenge by antigen; since IgM is not secreted in large quantities 
from memory B cells, elevated IgM is indicative of recent antigenic 
exposure. As stated earlier, IgM antibodies generally have low affin- 
ity, as they have not gone through the processes of somatic hyper- 
mutation and affinity selection. Nonetheless, the high avidity of 
plgM renders it capable of efficiently binding antigen. Similarly, a 
single polymeric IgM molecule is able to effectively initiate classical 
complement fixation, even though the affinity of Clq for Cp is very 
low. The Clq-binding site of IgM has been localized to the Q3 
domain (128) and appears to be dependent upon carbohydrate found 
there for potent binding (129). While Cfi domains (and their struc- 
tural homologues) are not well conserved evolutionarily, the ability 



to mix IgM and complement from different species and retain activ- 
ity indicates that the complement recognition sites on vertebrate 
immunoglobulins may be similar. In fact, even Xenopus IgM has 
been demonstrated to fix mammalian complement components 
(130)i IgM has also been shown to interact with C3b via its C M 1 
domain, thereby allowing antibody-antigen complexes containing 
IgM to indirectly mediate phagocytosis. By this mechanism, C3b, 
once fixed, can promote uptake via complement receptors found on 
macrophages. 

The high avidity of IgM for both antigen and complement is cru- 
cial in the context of its role as a front-line defense mechanism. IgM 
not only is the humoral agent of primary immune responses, but 
also— like IgA — is transported by the plgR across epitheha such that 
it serves a rote as a secretory immunoglobulin at mucosal surfaces. 
Since secretory immunoglobulins are present in breast milk as well, 
IgM also participates significantly in protecting the newborn from 
intestinal pathogens until such time as the neonatal immune system 
is fully functioning. A role for IgM in mucosal immunity must have 
developed early in evolution, as it is the sole immunoglobulin in 
some animals. 



IgD 

IgD is present in serum in very low amounts (less than 0.5% of 
total serum Ig). Although synthesis of IgD is also very low (at least 
an order of magnitude lower than that of IgM, IgG, and IgA), IgD s 
pronounced susceptibility to proteolytic degradation is probably 
also responsible for its scarcity in plasma and other bodily fluids. 
The unusually long hinge region linking Fab to Fc in IgD is thought 
to be largely accountable for its short half-life. IgD is secreted nei- 
ther during an immune response, nor following mitogenic stimula- 
tion of IgD+ B cells, although in the form of immune complexes it 
is known to be able to activate the alternative complement cascade. 
IgD's low levels make this complement fixation unlikely to be 
important in the in vivo context. In fact, no specific functions 
unique to IgD have been definitively assigned to the 5 Fc region in 
either its membrane-bound or soluble forms. That notwithstanding, 
the fact that the IgD class is maintained in all mammals, has a high 
level of conservation across species (131), and the existence of an 
Fc5 receptor, all suggest that it may have some distinct purpose. 
Still, two independently derived strains of IgD knockout mice have 
failed to ascribe to it a convincing immunologic role. In one strain, 
in which a premature stop codon was introduced into the C83 
domain, a subtle reduction in the total number of peripheral B cells 
was noted (132). In the other, which carried an insertion in its C s l 
exon and a frameshift in Q>3, delayed affinity maturation during T 
cell-dependent antigen responses was demonstrated (133). 

Although not known to have any unique function, IgD, together 
with IgM is a major surface component on many B cells. Because 
the C region genes for p and 5 are both transcribed in the same pri- 
mary RNA message, differential splicing to produce either IgM or 
IeD is required. This particular genomic organization facilitates 
their coexpression, which is not possible for any other isotypes 
(reviewed in Chapter 5). Mature, naive B cells migrate torn the 
bone marrow as IgM+/IgD+ cells (134) and make up about 90 A of 
peripheral B cells in both the murine and human systems (see 
Chapter 6). Similarly, B cells in the primary follicles of secondary 
lymphoid organs coexpress IgM and IgD, but as they mature to 
memory cells, IgD expression is typically lost (135,136). Curi- 
ously, studies of IgM+/IgD+ splenic B cells reflect that IgD surface 
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expression is actually tenfold higher than IgM levels (137). This is 
particularly puzzling, given that 5 message levels are lower than are 
\i mRNAs, and IgD (at least in the serum) is known to be so prote- 
olytically labile. Perhaps helping to explain this high level of IgD 
expression is that fact that IgD does not need to complex with other 
proteins for transport to the cell surface, distinguishing it from all 
other immunoglobulin classes (138). It is possible that IgD's high 
levels of surface expression and intrinsic flexibility (139) afford it 
a role in the early response to antigen (123a, 1 40). 

In addition to their coexpression, IgM and IgD have a number of 
commonalties in terms of their function as B cell antigen receptors. 
Like IgM, IgD is non-covalently associated with Iga/IgP het- 
erodimers, which serve as the signaling component of their BCR 
(138). Not surprisingly then, ligation of either IgM or IgD by anti- 
gen can independently mediate activation, deletion, or anergy of B 
cells (141), and likewise, the signals propagated by IgM or IgD 
BCR seem to be the same, albeit with different kinetics (see Chap- 
ter 7). Specifically, signals transmitted through surface IgD have 
been reported to cause induction of APC function (142); upregula- 
tion of coreceptors B7-1 and B7-2 (143); class switching to IgM, 
IgGl, IgG2, IgG3, and IgA (144); and increased secretion of IgE 
(145). The biologic significance of many of these findings remains 
unclear. In fact, reports (146,147) describing a new class of germi- 
nal center IgD+ B cells (having evidence of up to 80 different 
somatic mutations within their V regions!) demonstrate just how 
little is still understood about the cells expressing — and the pro- 
tein— IgD. 

IgG 

IgG is the predominant immunoglobulin in blood, lymph, peri- 
toneal fluid, and cerebrospinal fluid. Collectively, it makes up more 
than 75% of serum immunoglobulin and is synthesized at a high 
rate (over 30 mg/kg/d, second only to IgA). The presence of high- 
affinity IgG is the hallmark of secondary humoral immune 
responses. Electrophoretically, IgG proteins migrate to the y range 
of serum globulins, hence IgG's earlier designation as gamma- 
globulin. Actually, IgG is composed of four subclasses of antibody, 
whose salient features are summarized in Table 3. The selection of 
IgG subclass by a particular immune response does not appear to 
be random: in murine systems, anti-carbohydrate specificities tend 
to be IgG3, anti-protein IgGl, and anti-viral IgG2a (148,149). In 
man, reactivities against polysaccharide immunogens are skewed 
toward IgGl and IgG2, while anti-protein and anti-viral y antibod- 
ies are biased in the direction of IgGl, IgG3, and IgG4 (150). As 



an offshoot of these phenomena, clinical syndromes in which spe- 
cific IgG subclasses are absent are known to present themselves as 
characteristic immunodeficiencies (see Chapter 43). 

Perhaps the most studied feature of the IgG isotypes is their abil- 
ity to activate the classical complement pathway. Although all four 
are capable of initiating the classical cascade, they do so to varying 
degrees (G3>G1>G2>G4) (150,151). Understanding the means by 
which the different IgG subclasses interact with specific compo- 
nents of complement has been difficult, complicated by many con- 
founding reports. Results indicating that Clq is unable to bind 
either IgG2 or IgG4 antibodies (152) were perplexing, given that 
both are able to activate the classical cascade. Similarly, despite 
FcY3's higher affinity for Clq (152), IgGl is more effective at 
potentiating complement-mediated cytolysis. When the site for 
Clq-binding was mapped to the C-terminal portion of the Cy2 
domain near the hinge region (153,154), investigators surmised 
that differences in the IgG subclasses* abilities to activate comple- 
ment were likely attributable to steric freedom, or lack thereof, 
conferred by the particular hinge of the antibody (155,156). For 
this reason, the longer hinge of the y3 C region was thought to 
account for IgG3*s higher affinity for Clq, relative to that of IgGl . 
Still, hinge-deletion (157) and hinge-swapping (158) experiments 
have yielded data that contradict the notion of the hinge being a key 
determinant for complement activation. Be that as it may, recall 
that proper glycosylation within the Cy2 domain is accepted as an 
obligatory element for fixation of complement as well. 

The explanation for the difference in efficacy of lysis by com- 
plement between IgGl and IgG3 (paradoxical, given their affinities 
for Clq) is even more convoluted. It may reflect that other differ- 
ences between the IgGl and IgG3 Clq sites are present that affect 
complement activation. Alternatively, it may derive from differ- 
ences in a second, separate site in the Cyl domains of these mole- 
cules which, like IgM, binds activated C3b and protects it from 
inhibition. This second site also likely explains the capacity of 
IgG2 and IgG4 to activate the classical pathway, despite an inabil- 
ity to bind Clq. Finally, note that IgG4 is able to efficiently recruit 
and activate the alternative complement cascade, distinguishing it 
from the other three subclasses. 

Another means by which IgG antibodies communicate with the 
effector arms of the immune system is via the Fey receptors 
(FcyRs). A number of different IgG FcR exist (covered specifically 
in the section on the IgSF), each of which have their own profile 
and affinities for binding of the different IgG subclasses, expres- 
sion patterns on different cell types, and different biologic respon- 
sibilities (159). Among the immunologic cell types implicated as 



TABLE 3. Properties of Human IgG subclasses 


Property 


IgGl 


igG2 


igG3 


igG4 


Disulfide linkages 


2 


4 


5-15 


2 


Molecular weight 


146 kD 


146 kD 


165 kD 


146 kD 


Percentage of total serum immunoglobulin 


34-87% 


5-56% 


0.5-12% 


7-12% 


Average adult free serum level (mg/ml) 


5.9 ± 2.6 


3 ±2.5 


0.6 ± 0.55 


0.9 ± 0.25 


Macrophage binding by FcyR 


+ 




+ 


+ 


Placental transfer 


+ 


++ 


+ 


++ 


ADCC 




+ 


+++ 


+ 


Classical complement activation 


+++ 


++ 


++++ 


+ 


Alternative complement activation 


+ 


+ 


+ 


+++ 



Adapted from Simard and Mak (140). 



Structure and Func *'n of Immunoglobuuns / 63 



important binders of IgG are macrophages, polymononuclear cells, 
and lymphocytes (including B cells). Interactions with these recep- 
tors cause many functional effects, including phagocytosis (160) 
and antibody-dependent cell-mediated cytotoxicity (161), both of 
which ultimately lead to the destruction of the bound antigen. 
Specifically, the hierarchy for ADCC by mononuclear cells is IgGl, 
IgG3 > IgG2, IgG4 (152,162). Signals transmitted via FcyR also 
modulate lymphocyte function by means of up-regulation or down- 
regulation of antigen presentation, cytokine release, cytokine recep- 
tor expression and/or sensitivity, and even immunoglobulin secre- 
tion. Even soluble FcyR are known to bind IgG, although the 
significance of this finding is unclear (163). Finally, IgG FcR also 
permit transplacental movement of maternal antibodies during ges- 
tation (164). This provides the developing fetus with a source of 
high-affinity serum immunoglobulin that is able to interact with 
complement to mediate biologic effects at a time at which it has no 
other form of specific humoral immunity. It should not be over- 
looked that IgG molecules are the most stable isotype in serum 
(with a half-life of over 3 weeks), further maximizing their utility 
in this endeavor — even into the post-natal period. 

Binding of the four IgG subclasses by the different FcyR varies 
in terms of the specific contact residues involved for each respec- 
tive iigand-receptor pair. Generally, although the IgG binding sites 
for the FcR are thought to largely overlap, the precise elements 
responsible for interaction likely have subtle differences. By con- 
sensus, research into these issues has suggested that the sites are 
bipartite, consisting of a site on the C-terminal portion of the hinge 
and also reliant upon residues found in the portion of the Cy2 
domain already implicated in Clq-binding (86,165). Because the 
four IgG isotypes differ considerably in these regions, this would 
fit nicely with their noted differential binding of the varied FcyR. 

IgA 

IgA is the major immunoglobulin in external secretions such as 
saliva, mucus, sweat, gastric fluid, and tears. Moreover, it is also 
the major immunoglobulin of colostrum and breast milk, where it 
provides the neonate with a readily available source of intestinal 
protection against pathogens ( 1 67). The secretory forms of IgA are 
exclusively polymeric, including J chain and SC in the manner 
described previously. In addition, IgA — present predominantly in 
its monomeric form — is also an important component of serum Ig, 
where it makes up 10% to 15% of the total. The synthetic rate of 
IgA is roughly double that of IgG, such that total daily IgA pro- 
duction outpaces that of all other immunoglobulins combined. The 
majority of IgA synthesized is in the secretory form, with the 
largest fraction of IgA plasma cells residing in the subepithelial 
mucosa of the small intestine. Because secretory IgA coats all 
external surfaces except skin, it is rightly considered a first line of 
defense against organisms that would invade via mucosal routes. 
IgA's role in mucosal immunity (see Chapter 27) is phenotypically 
evident in persons with the most common genetic defect of the 
humoral immune system, IgA deficiency (see Chapter 43). Indi- 
viduals with this condition are susceptible to invasion across 
mucosal barriers and typically present clinically with recurrent 
infections of this type. 

Serum and secreted IgA originate from separate pools of B lym- 
phocytes: plasma cells in specialized sites of the respiratory, uro- 
genital, gastrointestinal, and mammary tissues produce the IgA 
found in secretions, while the IgA in serum emanates from plasma 



cells in the bone marrow, lymph nodes, and spleen. Despite this 
compartmentalization of production, antigenic exposure occurring 
at either mucosal or systemic sites will prime the development of 
both secretory and serum IgA responses simultaneously (168). 

In humans, the two IgA subclasses, IgAl and IgA2, show an 
interesting division of expression which affects their resultant bio- 
logic utilities. IgAl exists primarily as a monomeric molecule, and 
accordingly is the main IgA isotype in plasma (refer to Fig. 20). In 
bone marrow, about 90% of IgA-secreting plasma cells make IgAl 
(169). IgA2, on the other hand, is usually found as a polymer. 
Recall that the main structural difference between these two iso- 
types is localized to the hinge. Whereas the IgAl subclass has a 
higher concentration of carbohydrate in its hinge region (protecting 
it from most forms of proteolytic degradation), the IgA2 isotype 
has deleted much of that hinge region — presumably as ah evolu- 
tionary response to bacterial IgAl -specific proteases (170). Thus, 
it is consistent that the broadly protease-resistant form (IgAl) 
should predominate in serum to maximize its lifespan, while the 
targeted protease-resistant subclass (IgA2) should prevail where 
bacterial exposure is more common. 

IgA does not efficiently induce inflammatory responses. Rather, 
it is believed to protect primarily by exclusion, binding and cross- 
linking pathogens to prevent their uptake across epithelia and facil- 
itating their expulsion in mucus excretions (123a, 140). It is note- 
worthy that inflammatory responses localized to mucosa would 
likely be detrimental to barrier function, as tissue damage could 
compromise the integrity of epithelial surfaces. While IgA does 
have the ability to fix complement via the alternate cascade, this 
ability is restricted to IgAl. IgA can also opsonize antigens for 
phagocytosis; this is accomplished via a specific Fca receptor 
(FcotR) found on macrophages, monocytes, and neutrophils. This 
provides a mechanism for IgA immune complexes that accumulate 
at mucosal surfaces to be engulfed and processed. The FcccR is 
known to bind secretory IgA with higher affinity than serum IgA, 
but strangely, the site on IgA that is recognized seems to be unre- 
lated to the J chain or SC which distinguish the secretory and 
serum forms (66). Rather, in a manner unique from all other IgSF 
FcR, which see a hinge-proximal site in the Ch2 (or equivalent) 
domain, the FcctR sees a site bridging the domain boundary 
between C a 2 and C a 3, reminiscent of SPA binding (66). Finally, 
IgA has also been shown to induce eosinophil degranulation via the 
FcaR, implicating it in antiparasite immunity. Given that many 
parasites gain access to host tissues by crossing mucosal barriers, 
this is a logical biologic activity for IgA as well. 

IgE 

IgE is present in serum in the lowest concentration of all the 
immunoglobulins. Its rate of synthesis is between 25- and 2,000- 
fold less than each of the other isotypes, it has the shortest serum 
half-life, is unable to activate either the classical or alternative 
complement cascades, and lacks the ability to opsonize antigens. 
Nonetheless, IgE's biological effects more than compensate for 
these shortcomings, due to the profound efficiency of its behavior. 
The principle function of IgE is to arm basophils and mast cells 
with specific antigen receptors. These cells in turn act as potent 
dispensers of inflammatory reactions (see Chapter 32). 

Plasma cells that produce IgE are chiefly found in the lung and 
skin. Upon its release from these B cells, circulating IgE is quickly 
bound by a high-affinity Fee receptor (FceRl; K D = 10 ID M) found 
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on these granulocytes, allowing the IgE molecules to stably remain 
on the cells for weeks or months. Once primed with many such 
antigen-specific receptors (recognize that cells bearing FceRJ can 
have IgE molecules of many different reactivities on their surfaces, 
unlike the case for antigen-specific B cells), multivalent antigen 
can then cross-link the bound IgE, indirectly cross-linking the 
FceRI molecules as well. Ultimately, this causes mast cells and 
basophils to release granules containing inflammation-mediating 
substances and chemoattractants for a variety of cell types. The 
granule contents of mast cells and basophils are powerful, able to 
induce rapid responses — including mucous secretion, coughing 
and sneezing, vomiting, diarrhea, and inflammation. While this 
type of response can be vital in the clearance of parasites (see 
Chapter 38), it has the unfortunate consequences of also causing 
allergy (see Chapter 35) and anaphylaxis in predisposed individu- 
als. In such atopic individuals, it has been seen that increased 
amounts of IgE are synthesized and found on the surfaces of mast 
cells and basophils, likely explaining their predilection for these 
inappropriate responses. 

Other cells types also express the high-affinity FceRI, including 
Langerhan's cells (171,172) and eosinophils (173), but the ratio- 
nale for its presence there is yet to be definitively elucidated. In 
addition, the CD23 surface antigen has also been shown to be a 
low-affinity IgE receptor (designated FceRII). Among other cell 
types, CD23 is known to be expressed on monocytes and some fol- 
licular B cells. In fact, monocytes can even be induced to secrete a 
soluble form of FceRII (174), but once again the significance of 
this finding is unclear. Considering the low levels of circulating 
IgE, the relatively low affinity of the receptor, and the fact that 
CD23 is known to interact with CD11/CD18, there is doubt as to 
whether IgE is even an important ligand for this receptor in vivo. 

Like the plgR and several of the FcyRs, the FceRI molecule is also 
a member of the IgSF (detailed further in the following section). 
Interaction between IgE and its high-affinity receptor was the first 
well-characterized Ig-FcR ligand-receptor pair of this type. Origi- 
nally, studies using synthetic peptides as specific inhibitors of 
IgE-FceRI binding identified a 76-amino acid polypeptide spanning 
Ce2-C e 3 as the FcR recognition site on IgE (175). Subsequently, this 
localization was refined further to a site in Ce3 analogous to the FcyR 
site on Cy2. Unlike the binding situation for IgG that was also depen- 
dent upon residues in the hinge regions (see above discussion), the 
extra Ce2 hinge domain of IgE is not believed to play a significant 
role in the interaction (176). 

THE IMMUNOGLOBULIN SUPERFAMILY 

Evolution of the Immunoglobulin Superfamily 

Soon after the sequencing and structural analyses of antibodies 
revealed the protein motif of the immunoglobulin domain (7,177), 
it became apparent that evolution had incorporated Ig homology 
domains in a variety of other important molecules as well. The 
sequencing of MHC genes, TCRs, and the plgR, among others, 
demonstrated the use of both V region- and C region-type domains 
by a number of cell-surface proteins of the immune system. Con- 
temporaneously, a number of cell adhesion molecules (CAMs) 
involved with neurite outgrowth in developing axons were also 
found to contain Ig-like domains (reviewed in ref. 178). It was 
quickly recognized that a large family of genes that contained puta- 
tive immunoglobulin folds existed (2,3,179), whose members were 
globally implicated in issues of molecular recognition and/or cel- 



lular adhesion. Comprised of several multigene families in their 
own right (V H , V L , TCRa, TCR0, TCRy, TCRS, MHC I, MHC II, 
Sialoadhesin, CAM, etc.), the term immunoglobulin superfamily 
was adopted to refer to this diverse group of genes, which each 
contained one or more Ig homology domains. 

Currently the IgSF encompasses well over 100 genes, and 
extends across several phylogenetic boundaries (reviewed compre- 
hensively in ref. 4). Disparate species in which IgSF members have 
been identified include chicken, zebrafish, tunicates, grasshoppers, 
squid, C. elegans. sponges, and S. cerevisiae. In addition, reports 
identifying proteins containing structures reminiscent of immuno- 
globulin homology domains from prokaryotic organisms (1 80, 181) 
raise the possibility that this archetypal structure antedates even 
eukaryote evolution. The discovery of new molecules with novel 
functional attributes (for this class of proteins) also continues to 
expand the role of IgSF members. For instance, while the prepon- 
derance of immunoglobulin homology domain-containing proteins 
that have been identified are either cell surface or secreted proteins 
involved in recognition and adhesion events, a newer class of intra- 
cellular muscle proteins (titin, telokin, etc.) belonging to the IgSF 
demonstrate that the immunoglobulin fold structure is adaptable to 
an assortment of functional capacities. 

The evolution of the IgSF has been the subject of considerable 
scientific speculation and potentially has implications for both the 
development of the vertebrate immune system and the process of 
organogenesis in general. Based upon the overwhelming number of 
IgSF members that possess adhesive qualities, it has been proposed 
that the first IgSF molecules were simply single Ig domain extra- 
cellular proteins that served as primordial "cellular glues" (2,3). 
Substantiating this argument is the noted stability of the compact 
p-barrel structure of the immunoglobulin fold, which would foster 
its utility in harsh extracellular environments. Further bolstering this 
hypothesis is the fact that numerous IgSF proteins participate in 
both homotypic and heterotypic interactions with other IgSF mole- 
cules, demonstrating their potential to act as adhesion molecules. 

Some evidence suggests that IgSF proteins have promoted 
clustering of cells since the earliest stages of eukaryotic develop- 
ment. For example, the yeast S. cerevisiae uses the IgSF glyco- 
protein ct-agglutinin to mediate cell-cell contact during mating 
(182). IgSF forebears may have also allowed the first examples of 
rudimentary organogenesis in phylogeny. The slime mold Dic- 
tyostelium, which bridges the gap between unicellular and multicel- 
lular eukaryotes, uses a protein possessing a region with striking 
similarity to an immunoglobulin domain for the purpose of forming 
aggregations called "fruiting bodies" when conditions are nutrient- 
scarce (183,184). 

Finally, it has also been put forth that ancestral IgSF glycopro- 
teins may have mediated the first evolutionary examples of 
allorecognition in colonial invertebrates (185). In defense of this 
proposition, it is noteworthy that two examples of metazoan recep- 
tor tyrosine kinases with purported recognition functions have been 
shown to possess extracellular Ig-like domains, one from the 
cnidarian Hydra vulgaris and the other from the marine sponge 
Geodia cydonium. From these data, then, it is possible to make ten- 
tative, yet tenable, conjecture that the complex cellular and molec- 
ular interactions of the vertebrate immune system (mediated in no 
small part by members of the IgSF) may in fact be an outgrowth of 
this primitive allorecognition. In this light, the notable analogy 
between vertebrate graft acceptance/rejection reactions and colo- 
nial invertebrate fusion/rejection phenomena perhaps takes on new 
significance (see Chapter 18). 
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Regardless of its derivation, the immunoglobulin domain has 
obviously proven to be a pliable evolutionary substrate, amenable 
to mutation and diversification for a number of important reasons. 
First, as was noted for the actual domains of immunoglobulins, the 
primary structure of these units can vary dramatically without 
appreciably altering their tertiary structure (186,187). This is par- 
ticularly evident in the interconnecting loops that join the p strands, 
allowing them to diverge rapidly to perform a multitude of distinct 
functions. Second, most Ig domains are encoded by discrete exons, 
facilitating their duplication by relatively simple genetic events. 
This one-domain-per-exon rule is also conducive to alternative 
splicing phenomena, encouraging differential expression of IgSF 
molecules as well. This is further accommodated by a splicing con- 
vention followed by most IgSF exons: The 3' end of one exon is 
always the first position of a codon, while the 5' end of the next 
tandem unit begins with the second position of a codon. Thus, 
immunoglobulin homology domains of IgSF proteins may be eas- 
ily duplicated in tandem (the C. elegans muscle protein rwitchin 
contains 26 Ig-like domains) and shuffled to create new genes with 
the capacity to diversify both somatically and evolutionarily 
Finally, the propensity of Ig domains to form homotypic and/or 
heterotypic dimers (also demonstrated by immunoglobulin proper) 
forms the basis for proteins which serve as receptor and ligand 
molecules. These combinatorial associations enhance their diversi- 
fication potentials still further. 

Despite the inherent complexity in a gene superfamily contain- 
ing such vast disparities in its members' functional qualities, it was 
recognized early on that IgSF proteins could be subdivided into 
distinct "sets" on the basis of sequence and structural analyses (2). 
These groupings are based upon the arrangement of the p strands 
of the immunoglobulin fold and are schematized in Fig. 2 1 . Note 



that while V-set domains are composed of a four-strand sheet and a 
five-strand sheet (as detailed earlier), C-set domains have four- 
strand and three-strand layers; these are discriminated on the basis 
of placement of the D strand in the sheet of strands A, B, and E (the 
CI set) or with the layer formed by strands G, F, and C (the C2 set). 
However, studies with the IgSF muscle protein telokin have 
revealed a new "I set" which has domain features that are inter- 
mediary between the V and CI sets (188). Moreover, many IgSF 
adhesion molecules and cell-surface receptors likely belong to this 
I set, rather than to the sets to which they were previously ascribed. 
In any case, the IgSF remains a fascinating collection of proteins 
with structural similarities but a wide array of functional abilities. 
While immunoglobulin remains the definitive example of this class 
of molecules, a number of IgSF proteins are also of particular sig- 
nificance to humoral immune responses, and their structural and 
functional characteristics are briefly summarized in the following 
sections. 

Fc Receptor Molecules 

FcR allow antibodies to interact with cells of both the specific 
and non-specific immune systems. In so doing, FcR connect 
humoral immune responses to cellular immune responses, and 
more globally, acquired immunity to that of innate immunity. These 
contacts play two vital roles in the biology of immune functioning. 
First, FcR allow antibodies to act as "flags" signaling the need for 
certain cellular effector events, such as phagocytosis and ADCC. 
Second, the different FcR facilitate antibody acting as a mediator 
of overall immune regulation. The signals they transmit can induce 
changes in cytokine secretion, expression of cell-surface receptors, 
and extensive differentiation programs (189). 
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FIG. 21. Topology of different immunoglobulin domain types. Diagrams of the (A) C1 set, (B) V set, and (C) C2 set 
are presented. In the upper part of the figure, p strands are depicted as broad arrows and their intervening loops by 
thin lines. Note that the V-type domains have five- and four-stranded faces, while C1- and C2-type domains have 
four- and three-strand faces. The C region-like structures are discriminated on the basis of placement of their D 
strand. In the lower part of the figure, an end-on view of the different p-barrels is shown. Triangles {with their apex at 
the top) symbolize p strands running out of the plane of the paper; triangles {whose apex points down) are p strands 
traveling into the paper. Bold lines represent connecting loops at the top of the immunoglobulin fold; thin lines indi- 
cate connections at the bottom of the domain. (From ref. 4, with permission.) 
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Three large classes of molecules can bind Fc regions: glycosyl- 
transferases, which recognize oligosaccharide derivatives on anti- 
bodies, lectin-like molecules, and receptors belonging to the lgSF. 
Of the "true" FcR that recognize antibody protein determinants 
rather than carbohydrate, all FcR thus far identified belong to the 
IgSF, other than the low-affinity IgE receptor (CD23/FceRII). 
These molecules include FcyR I, II, and III (CD64, CD32, and 
CD 1 6), FceRI, FcaR (CD89), and the plgR, which has already 
been discussed. All cells of lymphoid origin express FcRs, 
although the profiles and isotype specificities between lineages can 
vary greatly (reviewed in refs. 190 and 191). While receptors for all 
classes of immunoglobulin have been described as biological activ- 
ities, human FcuR and Fc8R have not yet been cloned. Thus far, the 
FcyR proteins and FceRI are the most well characterized examples 
of these molecules. 

FcyR Molecules 

Receptors for the Fc portion of IgG are of three types (reviewed 
in ref. 192). FC7RI (CD64) is a high-affinity receptor and the only 
one able to bind monomelic IgG. It possesses three extracellular 
Ig-like domains. FC7RII (CD32) and FcyRIII (CD16) are both low- 
affinity receptors that bind IgG-containing immune complexes. 
They each have only two extracellular Ig homology domains. 
Schematic diagrams of the FcyRI,' FcyRII, FC7RIII complexes, 
along with the FceRI and FcaR complexes, are presented in Fig. 
22. 

FcyR! is a 70-kD glycoprotein that is constitutively expressed at 
low levels on monocytes and macrophages. IFN-y upregulates its 
levels on these cells, and also can induce its expression by neu- 
trophils. FcyRTs affinity for IgG is highest for the IgGl and IgG3 
subclasses (K D = 10" 8 M), tenfold lower for IgG4, and will not bind 
IgG2. Functionally, the primary effect of cross-linking FcyRI mol- 
ecules appears to be the potentiation of both ADCC and phagocy- 
tosis. As IFN-y enhances both of these activities by the cell types 
known to express FcyRI, this would fit well with their being 
important roles for the receptor. 

Like surface immunoglobulin, FcyRI requires accessory proteins 
in order to transmit signals. This is, in fact, a common feature of 
most FcRs (except for FcyRII) and an interesting parallel between 
the IgSF antigen receptors (BCR andTCR) and the IgSF "indirect" 



antigen receptors (the FcR), which use antibody to bridge the span 
between FcR and antigen (193). In the specific case of FcyRI, the 
actual signaling molecule is a 12y-kD transmembrane protein des- 
ignated the '*y-subunit" or, more generally, FcRy This nomencla- 
ture can be particularly confusing, as the "Y* of FcRy refers not to 
the fact that it is part of the of FcyR complex (the receptor for y- 
class immunoglobulin), but rather to y as an individual subunit of a 
multi-molecule complex. In any case, for the FcyRI complex, the a 
subunit is the actual IgSF protein FcyRI, and the y subunit is FcRy, 
which forms a disul fide-linked homodimer. Complicating termi- 
nology further, FcRy is also a subunit of other FcR complexes, 
including that of the FC7RIIIA and those of the non-FcyR, FceRI 
and FcaR. Intriguingly, FcRy is a close homologue of the TCR- 
associated protein CD3£. In fact, CD3£ cannot only heterodimer- 
ize with FcRy, but also has been shown to be capable of function- 
ally substituting for FcRy as the signal-transducing subunit of the 
FcyRIIIA receptor complex (194). 

The situation for FcyRII (CD32) is even more complex. FcyRI I is 
the product of three distinct but homologous genes: FcyRIIA, 
FcyRIIB, and FcyRIIC. This is further complicated by the fact that at 
least two of the FcyRII genes are alternatively spliced to generate 
multiple isoforms (195). The FcyRIIA gene gives rise to two tran- 
scripts: FcyRHal, which has a transmembrane domain, and 
FcyRIIa2, which lacks it The FcyRIIB gene has three isoforms — 
FcyRIIbl, FcyRHb2, and FcyRIIb3 — generated by differential splic- 
ing and alternative polyadenylation processing. Collectively, the 
FcyRII variants^are the most ubiquitously expressed FcyRs, being 
present on monocytes, macrophages, neutrophils, B lymphocytes, 
megakaryocytes, and platelets. Specifically, megakaryocytes express 
FcyRIIA (both isoforms), B lymphocytes express FcyRIIB (bl and 
b2 transcripts) and FcyRIIC, and cells of myelomonocyte derivation 
produce at least one or more isoforms from all three genes (195). 

Functionally, due to their expression on many cell types, FcyRII 
signals cause diverse effects. When cell surface FcyRII engage IgG 
immune complexes (all subclasses, with varying affinities), they 
potentiate several biologic changes, most immunoregulatory in 
nature. Generally, these signals down modulate IgG-, IgA-, and IgE- 
mediated activations of a number of cell types, including monocytes 
and macrophages, granulocytes, mast cells, and Langerhans and 
other dendritic cells. They also induce platelet aggregation at the site 
of immune complexes and effect B cell feedback inhibition by down- 
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FIG. 22. Schematic diagram of human immunoglobulin Fc receptors belonging to the IgSF. Each Ig domain is 
depicted as a rounded bulge. The a chains are the components of the receptor complex that determine binding 
specificity. (3 and y chains are responsible for association and signal propagation by the receptor(s). 
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regulating both proliferation and antibody production. Specifically, 
FcyRIIB are known to suppress BCR-mediated activation signals 
when the two are coaggregated (196). The signal for this inhibition 
is brought about in a manner unique among all IgSF FcR. As single- 
chain receptors without accessory proteins, FcyRII are able to trans- 
duce their own signals. They do so by way of an immunoreceptor 
tyrosine-based inhibition motif (ITIM) present in the cytoplasmic 
region of the protein (197). 

Specific binding of IgG by FC7RII was initially mapped to the C- 
terminal portion of the second extracellular Ig-like domain of the 
receptor (1 98). This primary site, comprising residues Asn 1 54-Ser 
161, has since been revised to also include domain 2 stretches Ser 
109-Val 1 16 and Phe 129-Thr 135, along with domain I contacts 
(199). In sum, a three-dimensional model of the entire FcyRII 
extracellular region predicts that loops of both Ig-like domains that 
co-localize to the domain interface are responsible for the recogni- 
tion of IgG. 

The final IgG FcR is the FC7RIII (CD 16). FcyRJII has two extra- 
cellular Ig-like domains and is encoded by two separate genes 
whose expression varies by cell type. On monocytes, macrophages, 
and NK cells, it is a transmembrane glycoprotein called FcyRIIIA. 
The FcyRIIIA receptor protein has three accessory proteins with 
which it is complexed. The first is a 30-kD "(3-subunit" having four 
transmembrane regions, which is also a component of the FceRI 
complex. The other protein(s) associated with FcyRIIIA is a 
homodimer of FcRy subunits, which, as explained earlier, is also 
part of the receptor complexes of FcyRI, FceRI, and FcaR (200). 
The other FcyRJII gene encodes a glycophosphoinositol-Hnked 
protein termed FcyRIIIB that is expressed on neutrophils. Individ- 
uals deficient in this gene suffer from a condition called paroxys- 
mal nocturnal hemoglobinuria, characterized by increased suscep- 
tibility to infection and delayed clearance of immune complexes 
(201). FcyRIIFs binding preference is for IgG! and IgG3, both of 
which it binds with low affinity only in the form of immune com- 
plexes. Biologic activities fostered by FcyRM include ADCC, 
phagocytosis, and transport of internalized Ab-Ag complexes to 
the antigen-presentation pathway. 

FceRI and FcaR Molecules ) 

The high-affinity IgE receptor (FceRI) was the first and best 
characterized of the FcR (202). FceRI is a transmembrane protein 
having two extracellular Ig homology domains and, like previous 
examples, associates with accessory proteins for signal-transmis- 
sion purposes. It is expressed on mast cells, basophils, eosinophils, 
Langerhans cells, and on the monocytes of atopic individuals 
(203). The proteins associated with FceRI are the same as for 
FcyRIIIA: one p subunit and a homodimer of FcRy proteins. Gene 
targeting experiments to verify the roles these proteins play in the 
FceRI complex have yielded reassuring results: homozygous dele- 
tion of the IgSF chain of the receptor created animals that were pre- 
dictably resistant to anaphylaxis (204); disruption of FcRy caused 
the same phenotype, plus defects in ADCC and phagocytosis con- 
sistent with the y-subunit's participation in other receptor com- 
plexes (205,206). 

The Fee-binding site on FceRI has been localized to three 
regions of the second extracellular Ig domain (198). It is important 
to remember that serum IgE binds to the receptor in a high-affinity 
interaction not dependent on antigen (unlike all other antibody iso- 
types, which must first bind antigen in order to be recognized by 



their respective FcR). This allows polyvalent immunogens to bind 
effector cells directly, without the need for conformational change 
of the immunoglobulin and/or immune complex formation. This 
contributes to the rapidity of the response exhibited by cells 
expressing FceRI. The specific biologic effects of cross-linking the 
receptor were discussed previously in the section on IgE function. 

The final FcR of the IgSF to be covered is that for IgA. The 
FcaR (CD89) is the most recently identified and least character- 
ized of the different Ig receptor classes. It possesses two extra- 
cellular Ig domains and is expressed by monocytes, macrophages, 
neutrophils, and eosinophils. Several isoforms have been identi- 
fied: a cell surface FcotRa form, which has intracellular and 
transmembrane domains, an FcaRb form lacking these domains 
that is both secreted and associated with the cell surface (by an 
unknown mechanism), and even an isoform lacking the mem- 
brane-proximal Ig-like domain. Structurally, the FcaR has 
homology with the FcyR molecules. Like FcyRI, FcyRJII, and 
FceRI, the IgA receptor complex includes the FcRy homodimer 
as a signaling component. The particulars of the receptor binding 
site on the FcaR are not yet determined. The site it recognizes on 
IgA and the effects mediated by FcaR-binding were detailed in 
the section on IgA function. 

Coreceptor CD4 and CD8 Molecules 

Antigen-recognition functions in the body are not limited to 
immunoglobulin but are also performed by receptors on T cells 
(TCR), which bind antigenic peptides. In a defining event of 
immune responses, these antigenic fragments are "presented" to 
T cells within the context of molecules of the MHC. Recognition 
of MHC/antigen by TCR is the fundamental biologic interaction 
responsible for initiation, perpetuation, and mediation of cellular 
immunity. As pertains to antibody, binding of MHC/peptide com- 
plexes by TCR is also vital for recruitment of T cell help needed 
in many humoral immune responses. While the details and effects 
of these vital interactions are well beyond the scope of this chap- 
ter (see Chapters 8-13), many IgSF proteins play key roles in 
assuring its productive functional outcome. Of these, the TCR 
"coreceptors" CD4 and CD8 are crucial components worthy of 
mention here. 

CD4 and CD 8 were among the first non- immunoglobulin IgSF 
members for which structural information became available 
(reviewed in ref. 207). These molecules are each expressed on the 
surface of T cells, where they participate in TCR/MHC interactions 
(schematized in Fig. 23) by engaging non-polymorphic regions of 
the MHC in low-affinity interactions (208,209). T cells break down 
into two major functional subclasses — helper T cells and cytotoxic 
T cells — characterized by different responses to antigen. Both T 
cell types utilize the same group of TCR genetic elements to com- 
pose their specific antigen-receptors, however. Ordinarily, although 
there are notable exceptions, T cell effector functions correlate 
with the type of MHC protein with which they interact. MHC I 
molecules specify cytotoxic T cell responses and are found on most 
cell types of the body. MHC II molecules, on the other hand, dic- 
tate helper T cell functioning and are more restricted in their 
expression, typically found only on "professional" antigen-present- 
ing cells. CD8 molecules bind to class I MHC proteins, while CD4 
molecules mediate interaction with class II; thus, these two pro- 
teins play an important role in determining what type of response 
a particular T cell is likely to mediate. 
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FIG. 23. Schematic diagram of the CD8 and CD4 coreceptor mole- 
cules. The figure shows both TCR/MHC coreceptor complexes on 
the same membrane, although T cells express either CD8 or CD4 for 
the majority of their lifetime. Bulges represent Ig domains, and gray 
ovals signify peptide presented by MHC molecules. Only the CD8ct/cc 
homodimer is schematized here, although the CD8ct/p heterodimer 
presumably binds MHC I in similar fashion. The models demonstrate 
the simplest stoichiometry for association; other possible modes of 
interaction are discussed in the text. 



Both CD8 and CD4 are glycoproteins and, in their most common 
incarnations, possess transmembrane segments and short cytoplas- 
mic tails (2 1 0). The intracytoplasmic regions of both molecules 
interact with the src-Uke tyrosine kinase p56 ,ck (211), which pre- 
sumably serves to allow signal-transduction necessary for proper 
thymic selection (212) and activation (213) of T cells. Reinforcing 
this idea is the fact that co-ligation of CD4 or CD8 with TCR 
greatly enhances stimulation of T cells relative to that of cross- link- 
ing TCR alone (see Chapters 1 2 and 1 3). In addition, CD8 and CD4 
also increase the avidity of interaction between TCR and MHC, by 
virtue of their action as adhesion molecules between the two cell 
membranes. Collectively, these behaviors have been estimated to 
boost antigen recognition over 100-fold from that of basal levels 
(TCR engagement by MHC/Ag only). However, despite these sim- 
ilarities in their biologic effects, structurally CD 8 and CD4 have 
many important differences. While many details are still unre- 
solved, a number of crystals involving these two proteins have been 
solved by x-ray diffraction, permitting a thorough examination of 
their salient characteristics. 

CD8 

CD8 exists as a disulfide-linked dimer in one of two forms. A 
homodimer of two CD8a subunits was the first human isoform iden- 
tified (214). Subsequently, a CD8oe/p heterodimer was described as 
well (215,216). Both proteins are 34-kD and have homologous 
(although only 17% identical) N-terrninal Ig-like domains, extended 
hinge regions of 50 (a) and 30 (p) amino acid residues, single trans- 
membrane stretches, and short cytoplasmic tails. While the CD8p 
chain lacks residues necessary for interaction with p56 lck , the het- 
erodimer is still capable of interacting with it via the a subunit. Nev- 
ertheless, several lines of evidence indicate that a specific role for the 
heterodimer may exist, apart from that of the homodimer. Differ- 
ences in functional effects (217), thymic selection (218), avidity for 
MHC (219), and p56 lck activity (220) have all been attributed to the 



CD8p chain. In addition, there are differences in expression of the 
different iso forms: thymocytes and peripheral T cells are CD8ot/p+, 
TCR y/5+ intraepithelial lymphocytes (IEL) are primarily CD8a/ot 
and TCR cc/p+ IEL express either of the two molecules. Finally, 
investigations into CD8p have divulged that there are actually two 
genes encoding this protein (221) (i.e., the locus has been recently 
duplicated), which — along with alternative splicing phenomena — 
results in as many as seven unique CD8P iso forms being expressed 
(222). Four of these transcripts lack the transmembrane -region of the 
message, raising the possibility that some forms of CD8p may be 
secreted. In sum, the story of this protein subunit, and of the CD8 
heterodimer that derives from it, is still an active area of research that 
is yet to be clarified. 

The enumeration of the CD8ot/a homodimer, thanks in large 
part to two crystal structures, is somewhat further along. The pri- 
mary advance in this regard was the solving of the amino-terminal 
domains of human CD8ot/cc (223). This study revealed that these 
1 13-amino acid segments formed V-type Ig domains consisting of 
four- and five-strand layers (see Fig. 24). In agreement with its V 
region-type topology, CD8a domains were shown to dimerize with 
one another via their five-strand faces, as do immunoglobulin V 
domains (see Colorplate 7). Two significant structural disparities 
between CD8cc and immunoglobulin V regions were also recog- 
nized. First, the C'-C" loop (corresponding to CDR2) is extended 
in the a subunit (note the right side of Fig. 24D). Second, while the 
usual intradomain disulfide bridge between p strands B and F was 
identified, an unpaired cysteine in strand C is also conserved. In 
rodent CD8a this residue has been shown to form the intradomain 
cystine together with the Cys of strand B (224). The a subunit 
hinge region is extensively glycosylated (via O-Iinkages), and this 
is thought to promote its adopting an extended structure 
(223,225-227). This is particularly important because an elongated 
conformation of the hinge would be necessary to allow the N-ter- 
minal Ig domain to interact with the MHC I molecule appropriately 
(refer back to Fig. 23). 

The other important structural features of CD8 concern its inter- 
action with class I MHC molecules. Initial experiments into these 
questions indicated that the CDR-like loops of CD8ct were involved 
with recognizing a negatively-charged region on the a3 domain of 
MHC I (228). The aforementioned crystal structure supported this 
conclusion by demonstrating that these same loops were the only 
region on CD8 where positive charge was localized. Mutational stud- 
ies performed after the crystal was solved also implicated residues in 
the A and B strands of the CD8a protein as contact points with the 
o2 domain of MHC I (229). As each CD8 homodimer has two a 
chains, and as the A and B strands of each chain are not on the dimer- 
izing face of the subunit, this implied that CD8*s interaction with 
MHC I could be bivalent (i.e., one CD8cc/a and two MHC class I 
proteins). The publication of a crystal structure of the complex con- 
taining CD8a/a and MHC I plus peptide has seemingly resolved 
these issues (230). The homodimer was shown to have contacts with 
not only the a3 domain, but also the a2 domain, and even with the 
p2-microglobulin subunit of MHC class I. Strikingly, the negatively- 
charged region of the cx3 MHC I protein fits between the CDR-like 
loops of the two CD8ct subunits in the fashion of classical anti- 
body-antigen interactions! However, because both CD8 subunits are 
needed for the binding of one such loop, it would appear that the sto- 
ichiometry of the CD8/MHC I interaction is in fact 1:1. Because the 
clustering of receptor complexes is likely an important feature for the 
generation of intracellular signals, this is a crucial piece of informa- 
tion, as shall be seen for CD4. 
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FIG. 24. Stereoviews of the a-carbon backbones of CD8cc, CD4 
domain 1, and the V L of the antibody REI. In (a,b) the Ig domain of 
CD8ct {solid lines) is superimposed on domain 1 of CD4 (dashed 
lines). In (c f d) V L (now in dashed lines) is overlaid by CD8a. Parts (a) 
and (c) are side views (parallel to the dimerization surface); (b) and 
(d) are perpendicular to the 0-sheet faces. In all cases, the CDR 
loops are at the top of the figure. Comparing parts (a) to (c) illus- 
trates the truncation of the F-G (top left) and C-C' (bottom left) 
loops of CD4 D1 relative to CD8a and V L . Comparison of the N-ter- 
mini [left edge of (b) and (d)] shows the shortening of CD4 DVs A 
strand as well. The CDR2-like C'-C*" loop [upper right of (b) and (d)] 
demonstrates that this segment is elongated in both CD8ct and CD4 
D1 relative to Vl. (From ref. 207, with permission.) 



CD4 

CD4 has four extracellular Ig homology domains (D1-D4) and 
is thought to exist as a 55-kD monomer on the celt surface (231). 
Scientifically, CD4 became the center of intense scrutiny when it 
was demonstrated to be the molecule utilized by HIV for attach- 
ment to T cells. Like immunoglobulin, proteolytic analyses estab- 
lished that CD4 generated stable fragments upon cleavage. These, 
in turn, were the initial substrates for crystallographic study. The 
amino-terminal (and T cell membrane-distal) D 1 D2 segments were 
the first regions of CD4 structurally determined, as they had been 
shown to contain the HIV-binding site. These studies (232,233) 
described an N-terminal V-type Ig domain (Dl) and a smaller, 
unusual Ig-like domain (D2), each with features unique among pre- 
viously reported IgSF structures. 

Dl is a four- and five-strand domain (see Colorplate 7) that 
maintains the normal core intradomain disulfide bond and its asso- 
ciated residues. However, by comparison with immunoglobulin V 
regions, it became apparent that part of the A strand was missing 
(see Fig. 24). Similarly, the two loops connecting p strands C to C 
and F to G were both shortened in length. Since amino acids in 
these positions of immunoglobulin (and CD8) participate in dimer- 
ization events, this was taken to be reflective of the fact that CD4 
was not known to dimerize. Like CD8, the CDR2-homoIogous C- 
C" loop of CD4 Dl is also extended relative to immunoglobulin 
(compare the right edges of Figs. 24B and 24D). In fact, a Phe 
residue found on this lengthened segment has been shown to be 
crucial for binding of HIV gpl20 to CD4. Another interesting char- 
acteristic of these crystals concerns the Dl to D2 domain connec- 
tion. Note in Colorplate 7 that the G strand of Dl is contigous with 
D2*s A strand (contrast with the elbow peptides connecting V and 
C domains). As a result, D 1 and D2 have a large amount of longi- 
tudinal contact and, accordingly, little flexibility to move relative to 
one another. 

The D2 domain is even more peculiar. D2 is smaller than most 
Ig domains, and it consists of only seven P strands, like Ig C-set 
domains (see Colorplate 7). Unlike the canonical C-type domain, 
D2 has switched the placement of one of its strands from one face 
to the other (i.e., it belongs to the C2 set; refer back to Fig. 21). 
Remarkably, D2 also fails to conserve the core residues necessary 
for the typical intradomain cystine. Curiously, its intradomain 
disulfide linkage is between cysteines found in the same p- 
stranded sheet (i.e., its disulfide bond is mfrasheet instead of the 
usual intersheet). Despite this idiosyncratic arrangement, D2 forms 
an Ig fold consistent with other Ig domains — a powerful testimony 
to the principle of tertiary structural conservation in the face of pri- 
mary structural variation, embodied by the IgSF 

The remaining two Ig domains of CD4 — D3 and D4 — have also 
been crystallized (albeit the rat CD4 homologue) and their struc- 
tures elucidated by x-ray diffraction (234,235). Remarkably, the 
D3D4 fragment adopts a conformation resembling that of the 
D1D2 portion of the molecule (see Colorplate 7), embracing prior 
hypotheses that CD4 arose by way of duplication of a two-domain 
precursor (236). D3 is a larger V-type domain homologous to Dl, 
and D4 is a smaller module reminiscent of D2. Once again, the D3 
G strand is contiguous with the A strand of D4, closely approxi- 
mating the two domains and limiting the flexibility between them. 
However, D3 does have characteristics which distinguish it from 
the Dl domain. First, D3 fails to conserve the intradomain disul- 
fide bond, resulting in a "relaxed" domain, which is packed less- 
tightly. Second, unlike Dl, the C to C and F to G loops of D3 are 
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not shortened relative to immunoglobulin V regions. Still, these 
loops are unlikely to mediate dimerization of D3 domains (as they 
did in the Fv fragment of immunoglobulin), on account of an re- 
linked glycosylation site on the F strand of D3*s inner face that 
would interfere with such interactions. Recently, a recombinant 
soluble form of human CD4 has also been solved crystallographi- 
cally (237). While consistent with previous conclusions, this report 
makes the added contribution of definitively establishing the D1D2 
to D3D4 junction as a hinge-like region of the protein. Thus, the 
rod-like two domain portions of CD4 (D1D2 and D3D4) are able 
to bend at a point of flexion akin to the scenario for Fab-Fc bend- 
ing at the antibody hinge. 

Studies have also examined the means by which CD4 binds to 
MHC II proteins (refer back to Fig. 23). CD4 contacts the a2 and 
p2 domains of class II molecules using a variety of residues in the 
D 1 and D2 domains. Most evidence points to a large surface of 
CD4, involving both lateral faces of Dl and the F^G loop of the 
D2 domain, being implicated in MHC II binding (238-240). 
Once more the question of the stoichiometry of interaction 
between coreceptor and MHC is of interest. As both sides of the 
D 1 domain appear to contact the class II molecule, the prospect 
of a bivalent complex (one CD4 protein with two MHC II mole- 
cules) is once again at issue. Given that crystal studies of class II 
proteins have demonstrated a dimeric association between MHC 
II molecules, this seems a plausible mode of complex formation. 
Contrarily, the crystal of soluble human D1-D4 (237) has 
revealed a homodimeric association between D4 domains of 
CD4. This implies that — as had been proposed by others (241) — 
opposite faces of a CD4 dimer may interact with two separate 
class II molecules. Regardless of the specifics of their dimeric 
interactions, it is reasonable to conclude that multiple surfaces of 
CD4 are responsible for binding, and that the majority of these 
amino acids reside in the Dl domain. 

CONCLUSION 

Although these discussions only scratch the surface of struc- 
ture-function relationships within the immunoglobulin superfam- 
ily of proteins, it is hoped that this chapter has served to introduce 
the reader to the inherent utility — and exquisite beauty — of the 
immunoglobulin domain as both an evolutionary tool and molecu- 
lar motif. While antibody proteins have been structurally charac- 
terized and functionally probed to an unparalleled degree by the 
concerted and persistent efforts of the scientific community, the 
continued emergence of unexpected findings indicates that a great 
wealth of knowledge is yet to be tapped in their inquiry. Moreover, 
the TCR and MHC IgSF proteins occupy an even greater role in 
terms of immune system functioning, and their investigations have 
been fruitful fields for study, as well. Other molecules, like CD4 
and CD8, while not the focus of attention for the prolonged dura- 
tion that has been the case for immunoglobulin, have nonetheless 
seen seminal findings in the pursuit of their understanding, and 
have served to broaden our comprehension of the IgSF's variety 
and capacity. Still others, like the antigen-receptor signaling pro- 
teins Iga, Igp, and CD3 subunits, or the co-stimulatory molecules 
B7-1 (CD80), B7-2 (CD86), CTLA-4, and CD28 have only 
recently been described in detail and will no doubt be centers of 
concentration in the immediate future. Given the unequivocal fact 
that new and exciting IgSF members are yet to be discovered, when 
one considers the pervasiveness of this class of proteins in 



immunology — and in biology as a whole — it is perhaps accurate to 
surmise that the study of the immunoglobulin superfamily is still 
only in its infancy. 
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